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FOREWORD 


The  first  in  this  series  of  conferences  had  as  its  host  the  Office 
of  Ordnance  Research  (now  the  Army  Research  Office).  It  was  held  in 
Durham,  North  Carolina,  in  late  1962,  and  was  entitled  the  "ARO  Working 
Group  on  Computers."  It  retained  this  title  for  only  two  meetings;  then 
the  name  was  changed  to  the  "Army  Numerical  Analysis  Conference. 11  Re- 
cently the  name  received  a modification.  The  title  is  now  the  "Army 
Numerical  Analysis  and  Computers  Conference."  This  new  designation 
emphasizes  both  phases  of  these  meetings. 

The  host  for  the  initial  conference  also  served  as  host  for  the 
present  conference--the  thirteenth  in  this  series  of  meetings.  The 
Army  Mathematics  Steering  Committee  (AMSC)  continues  to  be  their  spon- 
sor. Members  of  this  committee  would  like  to  thank  Dr.  Paul  Boggs  for 
serving  as  Chairman  on  Local  Arrangements.  He  did  an  outstanding  job  in 
carrying  out  the  many  tasks  associated  with  conducting  a conference  of 
this  size. 

"The  Impact  of  Mini -Computers  and  Micro-Processors  on  Scientific 
Computation  in  Army  Research  and  Development"  was  the  theme  of  the  1976 
conference.  A Panel  Discussion  in  this  area  was  one  of  the  outstanding 
features  of  this  meeting.  It  was  chaired  by  Professor  David  J.  Farber 
of  the  University  of  California  at  Irvine.  The  four  members  of  his 
panel  were  Dr.  E.  David  Crockett,  Hewlett-Packard's  Data  Systems  Division, 
Professor  E.  J.  Desautels,  University  of  Wisconsin,  Dr.  Ivan  Sutherland, 
Rand  Corporation  and  Mr.  Eric  Wolf,  Bolt  Beraneck  Newman. 

The  keynote  address  was  delivered  by  Dr.  E.  David  Crockett.  He 
titled  his  talk  "Is  the  Mini-Computer  the  Next  Dinosaur?"  Another 
address  which  was  also  closely  related  to  the  theme  of  the  conference 
had  as  its  title  the  "Evolution  of  Micro-Computer  Technology."  It 
was  delivered  by  Dr.  Evan  Sutherland.  Two  other  featured  speakers  were 
Dr.  Achi  Brandt,  IBM  Thomas  J.  Watson  Research  Center,  and  Professor 
Gene  H.  Golub,  Stanford  University.  The  respective  titles  for  their 
addresses  were  "Multi-Level  Adaptive  Techniques  (MLAT)  for  Discretizing 
and  Solving  Partial  Differential  Boundary  Value  Problems"  and  "Least 
Square  and  Robust  Regression."  Members  of  the  AMSC  would  like  to  extend 
their  thanks  to  the  above-mentioned  panelists  and  invited  speakers  for 
sharing  with  members  of  the  audience  their  knowledge  about  new  numerical 
analysis  techniques  and  new  developments  in  the  computer  field.  Also, 
they  wish  to  thank  those  scientists  presenting  the  thirty-three  con- 
tributed papers.  Without  their  input  to  this  meeting  it  could  not  have 
fulfilled  its  full  significance  as  an  Army  conference. 

The  responsibility  for  organizing  these  symposia  rests  in  the  hands 
of  the  AMSC  Subcommittee  on  Numerical  Analysis  and  Computers.  Its  chair- 
man, Dr.  Ronald  p.  Uhliq,  hold  an  open  meeting  of  this  subcommittee  on 
the  last  day  of  this  symposium.  Among  the  topics  brought  up  for  discussion 
was  the  theme  for  the  1977  conference.  After  considerable  interplay  of 
ideas  and  many  suggestions,  the  theme  receiving  the  strongest  indorse- 
ments was  entitled  "Numerical  Techniques  for  Solutions  of  Nonlinear 
Partial  Differential  Equations,"  We  are  pleased  to  be  able  to  announce 
that  the  Mathematics  Research  Center,  University  of  Wisconsin  at  Madison, 
Wisconsin,  will  serve  as  the  host  of  this  coming  conference. 
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ADDENDUM 

An  invited  general  lecture  entitled, 
"Evolution  of  Micro-Computer  Technology" 
will  be  delivered  by  Dr.  Ivan  Sutherland 
of  the  Rand  Corporation,  Santa  Monica, 
California  at  1930  on  Wednesday  evening. 
This  talk  will  be  followed  by  the  panel 
session  as  announced. 
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A CASE  HISTORY  EXPLORING  THE  TRANSPORTABILITY  OF  A MATHEMATICAL 
ALGORITHM  FROM  A LARGE-SCALE  COMPUTER  TO  A MICROCOMPUTER 


S.  Kravitz  and  J.  A.  Hauser 


Aeronutronic  Ford  Corporation 
Western  Development  Laboratories  Division 
Paid  Alto,  California 


Abstract 


Problem-solving  techniques  as  adapted  to  microcomputers  compare 
in  varying  degrees  with  those  used  with  large-scale  computers  and 
minicomputers.  The  latest  hardware  technology  has  re-exposed  some 
of  the  same  numerical  and  programming  problems  that  were 
attendant  upon  the  introduction  of  minicomputers  in  replacement  of 
large-scale  computers.  A case  history  of  a typical  microprocessor 
application  is  presented,  with  emphasis  on  algorithm  transport- 
ability and  microprocessor  restrictions  and  capabilities.  The 
illustrative  example  is  a Graphics  Plotter  controlled  by  an  Intel-8080 
microprocessor.  Recommendations  for  future  studies  in  algorithm 
standardization  are  presented. 

A.  INTRODUCTION 


Technological  developments  in  the  field  of  microprocessors  within  the  past  few  years  have 
revolutionized  the  electronics  industry.  Microprocessor  prices  have  been  drastically 
reduced  recently,  making  it  virtually  impossible  to  ignore  them  as  cost-effective  alterna- 
tives to  minicomputers  and,  in  some  applications,  large-scale  computers. 

Use  of  a digital  computer  for  problem  solving  is  typically  divided  into  the  formulation, 
algorithm  development,  and  programming  phases.  Problem  solving  can  be  viewed  as  a 
team  effort  involving  a numerical  analyst,  programmer,  and  electronic  engineer.  Micro- 
processor-based applications  generally  involve  some  electronic  fabrication  which  includes 
circuit  design  and  integration  of  input/ output  sensors. 
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The  computer  architecture  must  be  taken  into  account  in  each  development  stage.  Micro- 
processor applications  differ  from  large-scale  or  minicomputer  applications  in  the  areas 
of  limited  arithmetic  capability,  lack  of  adequate  support  software,  and  in  the  require^ 
meats  of  solving  real-time  data  acquisition  problems. 

The  use  of  large-scale  computers  for  microprocessor  software  development  is  an 
accepted  technique.  Proofing  and  simulating  algorithms  before  developing  them  for  the 
more  difficult  microprocessor  environment  is  made  possible  through  the  use  of  cross  soft- 
ware, such  as  compilers,  assemblers  and  simulators,  which  operate  on  a host  computer. 
However,  transporting  these  algorithms,  or  those  already  in  use  on  large-scale  computers, 
still  presents  numerical  and  software-language  difficulties. 

The  present  day  use  of  large  computers  to  solve  engineering  problems  in  general  is  im- 
plemented in  high-level  languages  such  as  FORTRAN.  The  use  of  a high-level  language 
has  certain  drawbacks,  not  the  least  of  which  is  an  abstraction  (or  transparency  effect) 
which  masks  the  nature  of  the  actual  computer  arithmetic  operations.  In  using  micro- 
computers, however,  the  practitioner  must  be  keenly  aware  of  the  hardware  implementa- 
tion of  the  arithmetic  operations  and  the  numerical  problems  attendant  on  small  word 
sizes. 

A typical  microprocessor  system  generally  consists  of  the  following  system  components: 

CPU  - The  Central  Processing  Unit  controls  the  communications  between 
memory  and  the  input/output,  keeps  track  of  the  program,  and  operates  on 
instructions  via  the  ALU  (Arithmetic  Logic  Unit). 

MCU  - The  Memory  Control  Unit  controls  which  memory  chip  is  accessed 
by  the  CPU.  A decoder  is  often  used  for  this  purpose. 

DCU  - The  Device  Control  Unit  selects  the  input/output  accessed  by  the  CPU. 
hi  general,  these  are  the  selected  port  addresses. 

Memory  — Most  microprocessors  employ  both  ROM  (Read  Only  Memory)  and 
RAM  (Random  Access  Memory). 

System  Clock  - Although  some  micros  now  have  on-chip  clocks,  many  still 
require  an  external  clock  chip  for  system  timing. 
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Interface  Chip  - The  interface  chip  is  a register  (either  programmable  or  not) , 
controlled  by  the  CPU,  and  is  used  to  interface  to  the  outside  world. 

Microprocessor  applications  fall  into  the  following  general  categories;  controllers, 
terminals,  communication  equipment,'  and  consumer  products.  Specifically,  micro- 
processors are  being  used  in  point-of-sale  terminals,  onboard  vehicle  control,  banking, 
recreational  games,  industrial  controls,  time- sharing,  remote  batch,  and  numerous  other 
applications. 

B.  ALGORITHM  TRANSPORTABILITY  CONCEPTS 
B.  1 Algorithm  Development 

For  this  discussion,  algorithms  are  those  numerical  methods  which  are  used  to  solve 
problems  on  digital  computers.  By  definition,  they  are  completely  unambiguous,  and 
should  include  an  error  analysis.  The  error  analysis  includes  accuracy  requirements, 
estimation  of  round-off  and  discretization  error,  step-size  and  iteration  counts,  and 
non-convergence  allowances.  For  the  context  of  this  paper,  algorithms  include  pro- 
gramming considerations  which  are  necessary  for  software  implementation. 

A much- sought-after  objective  in  scientific  computer  applications  is  transportability  of 
algorithms.  Transporting  an  algorithm  involves  conversion  and  translation  of  the  computer- 
dependent  features  so  that  the  algorithm  can  be  implemented  in  software  on  a different 
computer.  Computer  algorithms  have  been  developed  during  the  last  two  decades  at  great 
cost  and  unfortunately,  with  duplication  of  effort  in  conversion  to  newer  and  different  hard- 
ware. Some  earlier  large  computers  used  for  technical  problem  solving  had  36-bit  word 
architecture  (with  some  exceptions,  notably  the  Philco  2000  series,  48-bit  word);  32-bit 
and  60-bit  word  machines  were  adapted  later.  Minicomputers,  arriving  somewhat  later  on 
the  scientific-applications  scene  generally  used  16-bit  words,  although  this  also  varied. 
More  recently,  microcomputers  became  popular.  These  include  2,  4,  8,  12,  and  16-bit 
architectures,  m general,  the  use  of  floating-point  arithmetic  hardware  has  been  confined 
to  large-scale  computers  and  some  minicomputers. 

With  a diversity  in  number  representation,  arithmetic  capability,  and  software  languages, 
it  is  no  wonder  that  the  goal  of  algorithmic  transportability  seems  no  closer  in  the  micro- 
computer era  than  it  did  in  the  minicomputer  era.  The  practical  development  of  a numerical 
algorithm  involves  an  analysis  which  considers  not  only  computer  word  size  and  ..rithmetic; 
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but  also  vagaries  of  software  languages,  software  library  functions,  and.  In  the  case  of 
microcomputers,  the  pecularities  of  real-time  data  acquisition  - notably  digitization. 

The  use  of  numerical  analysis  techniques  for  real-time  microcomputer  usage  differs  in 
some  respects  from  that  used  in  defining  a large-scale  computer  algorithm.  Memory  and 
speed  constraints  restrict  the  use  of  extended- word  arithmetic  and  floating-point  software. 
The  trade-offs  are  not  always  simple  nor  are  the  alternatives  clearly  defined. 

The  extensive  algorithmic  and  software  literature  represents  a treasure  house  of 

ingenious  techniques  developed  with  great  effort  and  expense.  The  future  usefulness  of 
microcomputers  to  solve  problems  heretofore  reserved  for  larger  and  more  expensive 
machines,  depends  upon  the  conversion  techniques  adopted.  They  also  depend  on  the 
software  aids  developed  to  emulate,  if  necessary,  the  larger-word  machines  for  which 
the  algorithms  were  originally  developed. 

B.  2 Software  Development 

The  general  acceptance  of  FORTRAN  as  the  standard  in  technical  software  development 
has  been  aided  by  the  efforts  of  the  X3J3  FORTRAN  committee^.  Unfortunately  for 
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users  of  mathematical  algorithms,  standardization  is  an  objective  still  to  be  attained.  The 
differences  in  machine  architecture  must  be  taken  into  account  in  defining  an  algorithm. 

This  is  especially  true  in  the  use  of  microprocessors.  The  relative  newness  of  this 
technology  and  its  application  in  general  problem  solving  indicates  that  little  in  the  way  of 
software  support  tools  are  available.  The  first-time  user  is  cautioned  against  assuming 
the  existence  of  any  extensive  mathematical  library  supplied  by  the  manufacturer. 

A purchaser  of  a microcomputer  system  can  obtain  a manufacturer-  supplied  operating 
system,  assembler  programs,  and,  in  some  cases,  higher- level- language  compilers. 
Unlike  the  minicomputer  environment,  suppliers  generally  do  not  furnish  a compre- 
hensive mathematical  library  which  is  fully  tested  and  warranted.  However,  user 
groups  are  being  formed,  and  programs  are  being  exchanged  on  an  informal  basis.  Faced 
with  the  task  of  implementing  an  algorithm  on  a microprocessor,  the  development  team 
must  plan  to  convert,  or  transport  and  implement,  existing  algorithms  (sin,  cos,  etc. ) 
from  larger  machines. 

High-level- language  compilers  have  been  developed  as  an  alternative  to  machine  language. 
Programs  take  less  time  and  are  easier  to  write  than  their  assembly  language  courier  parts. 
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The  Intel  PL/M  cross  compiler^  is  an  early  example.  Other  support  programs  are 
being  announced  in  technical  publications;  ^ however,  there  appears  to  be  little  in  the 
way  of  standardization. 

Figure  1 illustrates  a typical  microprocessor  software  development  cycle.  The  develop- 
ment team  of  numerical  analyst,  programmer,  and  electronic  engineer  must  interface 
throughout  the  cycle  to  ensure  a valid  and  verifiable  implementation. 


Figure  1.  Microprocessor  Software  Development  Cycle 
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c.  MICROPROCESSOR  RESTRICTIONS  AND  IMPLICATIONS  FOR  ALGORITHM  DEVELOPMENT 
C.  1 Hardware  Restrictions 

Unlike  the  large  scale  computer,  the  microprocessor  is  restricted  in  its  operation  by 
various  inherent  parameters.  Although  nomenclature  differences  exist  between  micros, 
minis  and  large-scale  units,  a meaningful  comparison  can  be  made. 

Word  size  - The  standard  microprocessor  word  lengths  vary  from  2 bits  to  16  bits.  However, 

8 bits  seems  to  have  become  the  industry  standard.  Compared  to  32-bit  or  36-bit  words 
used  in  many  large-scale  computers,  the  limitations  are  clear.  Although  multiword 
definitions  increase  microprocessor  software  capability  and  accuracy,  coding,  memory, 
and  speed  restrictions  limit  their  use. 

Speed  - The  instruction  time  of  microprocessor  commands  depends  primarily  on  the 
technology  of  the  unit.  Whereas  the  more  popular  microprocessors,  such  as  the  Intel  8080 
and  Motorola  M6800,  use  NMOS  technology  and,  therefore,  have  slow  instruction  times 
(approximately  2 ps),  bipolar  microprocessors  are  becoming  more  popular  and  less 
expensive.  These  units  offer  much  faster  instruction  times  (approximately  10  to  100  times) 
at  the  expense  of  more  power  and  a larger  number  of  chips  per  system. 

Memory  restrictions  - The  amount  of  memory  a microcomputer  system  can  handle  is 
restricted  by  the  amount  of  addressing  available.  Many  micros  use  a 16-bit  address  bus, 
enabling  a micro  to  have  a capacity  of  65K  memory.  In  addition  to  memory  restriction, 
semiconductor  memory  access-time  restrictions  can  become  important  in  real-time 
applications. 

Digital  to  Analog  Converter  - The  restrictions  due  to  Digital-to-Analog  Converters  (DAC's) 
are  mentioned  here  because  of  their  implications  in  the  design  example.  DAC's  are  specified 
by  resolution  (number  of  bits  being  converted)  as  well  as  type  of  technology.  The  higher  the 
resolution  the  smaller  the  increment  of  output  voltage  each  bit  represents. 

W <2"“vref> 

where  n represents  the  resolution  in  bits  for  unipolar  operation. 
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The  design  must  also  take  into  account  the  settling  time  of  the  DAC.  This  is  the  time  re- 
quired for  the  output  function  to  settle  within  1/2  LSB  for  a given  digital  input  stimulus. 
Various  other  specifications  are  important  in  the  determination  of  the  accuracy  of  the 
converted  number. 

C.  2 Software  Restrictions 

Microprocessor  software  can  be  subdivided  into  two  major  categories:  microprogramming, 
and  fixed  instruction  set  programming.  Nearly  all  bipolar  microprocessors  are  micro- 
programmable  (a  user-defined  instruction  set,  utilized  at  the  fundamental  register  transfer 
level).  Most  MOS  microprocessors  are  not  microprogrammable  and  have  a fixed  instruction 
set.  The  more  popular  Intel  8080  and  Motorola  M6800  fall  into  this  later  category.  Due  to 
general  size  and  speed  constraints,  microprocessor  instruction  sets  are  not  as  extensive 
as  their  large-scale  computer  counterparts. 

Many  microprocessor  vendors  are  now  making  available  a higher  level  language.  The  more 
popular  ones  are  PL/M  and  FORTRAN,  with  newer  ones  such  as  MPL  (Motorola)  soon  to 
be  released.  Although  these  languages  alleviate  some  of  the  problems  of  assembly  language 
programming,  they  create  a lot  of  their  own.  Some  of  the  problems  of  assembly  language 
and  higher  level  language  are  as  follows: 

Multiply/Divide  - Most  fixed  instruction  microprocessors  do  not  have  a multiply  or  divide 
instruction.  Some  newer  microprocessors  consider  this  problem  and  either  have  such 
instructions,  or  have  a hardware  multiply /divide  circuit;  however,  many  algorithms  have 
been  written  to  compensate  for  this  oversight. 

Fixed  Point  Arithmetic  - Several  higher  level  languages  such  as  PL/M  do  not  incorporate 
floating-point  arithmetic.  Although  floating-point  subroutines  have  been  written  and  can 
be  called  as  part  of  the  program,  a definite  disadvantage  both  in  flexibility  and  time  is 

recognized. 

Unsigned  Arithmetic  - An  annoyance  one  must  keep  track  of  in  PL/M  is  the  minus  sign  in 
comparisons.  In  the  case  of  an  A>B  check,  a negative  A which  has  an  absolute  value  greater 
than  B would  appear  to  be  greater  than  B,  whereas  it  is  really  less  than  B. 

In  real-time  data  acquisition,  the  special  purpose  data  gathering  hardware  is  usually 
designed  and  constructed  simultaneously  with  the  algorithm  development.  Thus, 
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simulation  tools  must  be  used  to  verify  the  algo  'ithms.  Especially  useful  are  simulation 
programs  that  run  on  large-scale  computers  an  l that  allow  software  to  be  checked  out 
before  final  checkout  on  the  candidate  hardware  Testing  on  the  candidate  hardware  will 
t>wn  verify  that  sensors  and  conversion  devices  are  correct  and  adequate.  The  Intel 
Ihtellec  microcomputer  system  permits  input/o  itput  through  a straightforward  PL/M 
command  that  directly  connects  the  application  >rogram  to  a designated  hardware  input 
port.  To  output  a specific  value,  the  following  command  is  used: 

OUTPUT  (POET  NUMBEE)  = OU  TPUT  VALUE 

D.  ILLUSTRATIVE  EXAMPLE 

The  development  of  a microprocessor- controlled  plotter  to  draw  two-dimensional  ellipses, 
although  simple,  is  illustrative  of  some  of  the  problem  areas  in  algorithm  transportability. 

U.l  Problem  Statement 

Given  the  major,  minor  axis  lengths  (a,b),  draw  a smooth  ellipse  using  a microprocessor- 
controlled  analog  plotter.  This  problem  is  trivial  given  a modern  computer  and  supporting 
software.  However,  as  will  be  shown,  the  microcomputer  solution  is  not  trivial,  certainly 
not  for  a first-time  microprocessor  user  who  has  become  accustomed  to  large  machine 
support. 

Figure  2 is  an  artist’s  rendition  of  the  final  objective,  a portable  microprocessor  based, 
plotting  system. 

D.  2 System  Design 

Figure  3 illustrates  the  system  solution  to  the  ellipse  plotter  problem.  The  interactions 
between  algorithm,  software,  hardware,  and  human  factors,  influenced  the  design. 

D.  3 Hardware 

The  hardware  required  for  the  Graphics  Plotter  can  be  divided  into  two  sections:  (1)  the 
development  hardware,  and  (2)  the  prototype  hardware.  The  development  hardware  consists 
of  a system  such  as  the  Intel  Intellec  8/Mod  80  with  the  I/O  connected  to  two  10-bit  DAC's 
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and,  subsequently  through  appropriate  amplification,  to  the  X and  Y inputs  of  an  X-Y 
recorder.  Data  entry  is  handled  through  a teletype. 

The  portable  prototype  system  employs  an  Intel  8080A  microprocessor  connected  to 
several  256  X 8 ROM's  and  256  X 4 RAM's,  fii  addition,  a programmable  I/O  interface 
chip  handles  the  input  and  output,  a clock  generator  and  crystal  handle  the  system  timing, 
a l-of-8  decoder  does  the  device  control  selection,  and  a USART  is  used  for  data 
communications  through  a current  loop  or  RS232  type  interface.  A system-controller  chip 
handles  the  proper  timing  signals  to  the  rest  of  the  system. 

The  prototype  system  also  employs  a keyboard  for  data  entry  and  an  alphanumeric  display 
for  information  verification  and  data  interchange. 


Figure  2.  Portable  Microprocessor-Based  Plotting  System 
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Figure  3,  Microprocessor- Controlled  Graphics  Plotter- Systems  Block  Diagram 
D.4  Algorithm 

The  following  discussion  traces  the  algorithm  development  in  solving  the  equation  of  the 
ellipse  on  a microprocessor. 

Cartesian  Approach 

Solve  for  x and  y , given  a and  b 

y=  (b2 - b2xV/2  Eq.  D4-1 

a 

where  x ranges  from  -b  to  +b  in  steps  of  0. 01  inches 

alternatively  y = (b  - bx)  V2  (b  + bx)  ^ 2 Eq.  D4-2 

a a 
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Parametric  Approach 


a “ a cos  0 Eq.  D4-3 

y»  b sin  © 

where  9 ranges  from  0 to  2n  in  steps  of  0.01  radians. 

The  cosine  function  ^ was  initially  approximated  by  the  three-term  economized  Chebyshev 

series 

2 4 

cos(x)  = 1 + a2x  + a^x  Eq.  D4-4 

where  a2=*  -0.49670 

a4  a 0. 03705 

The  series  was  rewritten,  replacing  the  decimal  representation  with 


1000 


where 


cos  (x)  = 


K = x • 100 


-I  — \ 

497  - 270  I • _K  • K. 
\ 2 / 82  61 


1000 


Eq.  D4-5 


Eq.  D4-6 


A more  accurate  six-term  series^  currently  being  implemented  is 

cos(x)  = 1+x2 1 a2  + x2  J^a4  + x2(a6  + x2  [ag  + a10  x2])  | 

where  a2  = -0. 4999999963 

a4  = 0. 0416666418 

a6  = -0.0013888397 
a8  = 0.0000247609 

a10  = 00000026°5 

The  number  of  multiplications  was  reduced  by  factoring^and  the  decimal  constants  will  be 
replaced  by  their  reciprocal  integer  counterparts. 
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D.S  Error  Analysis 


The  round-off  error^in  solving  the  equation  of  the  ellipse  is  represented  by  the  function  E. 

| E | < | Ep  I + I Eg  | Eq.  D6-1 

where 

Ep  Is  the  error  due  to  the  finite  machine  representation  of 
real  numbers;  it  includes  the  propagation  error  due  to  the  arithmetic  operations. 
Eg  represents  the  generated  error,  introduced  as  a result  of  imperfect  machine 
arithmetic  operations. 

An  example  of  generated  error  is  the  truncation  caused  by  integer  division.  In  the  micro- 
computer environment,  E will  depend  upon  the  range  of  the  parameters,  the  accuracy  of 
file  function  approximations,  word  size,  scaling,  and  the  fixed  or  floating-point  mathema- 
tical operations.  Although  not  shown,  a statistical  approach  can  be  used  to  estimate  E. 


Propagated  error  may  be  approximated  by  Taylor’s  theorem: 

F(u,v,w,.  . . , t)  - F(u*v*w*  . . . t*)  ~ -f~  (u-u*)  + ££  (v-v*) 

C»U  ffV 

8F  * 9F  * 

+ lw(w“w  > + * * • + -at  ft-*  > Eq.  D5-2 


where 

u#  v,  w, . . . t are  the  true  values  of  the  function  parameters 
* * * ,* 

u , v , w , . . . t are  machine  approximations 


or 


E = AF  **  ~ E 
p <?u  pu 


9J  E 

dy  pv 


&F  -p 

dw  pw 


9F 

* at  Ept 


where 


pu 

S 

pv 

S 

pw 

Pt 


= u - u* 

= ■ V - V* 

= w - w* 
= t - 1* 


Eq.  D5-3 
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For  the  Cartesian  approach,  where 


F(x#  a,  b)  = (b2 


£<]•  D5  —4 


E = 

py 


- e + 


9 9 

b x 


a"y 


• f 


b 

*.  V ♦ \-r~l  v 


a* 


Eq.  D5-5 


If  Epa  and  E ^ are  assumed  to  be  zero,  the  Cartesian  error  maybe  expressed  as: 


Epy  ( $)  = V x 


Eq.  D5-6 


For  the  parametric  approach,  where 


F(a,  B)  **  a cos  0 
F(b,  6 ) = b sin  0 


Eq.  D5-7 
Eq.  D5-8 


E and  E v,  the  parametric  error  terms  are  evaluated  as 

<"•  JrJ 


E„  s:  cos©  E - a sin <3  E 

px  pa  p© 


Eq.  D5-9 


E =r  sin9  E . + b cosd  E „ 
py  Pb  Pd 


Eq.  D5-10 


Where  E and  E , are  assumed  to  be  zero,  the  parametric  error  may  be  expressed  as: 
pa  pb 


E 


Py  (^2£)  EpxX 


Eq.  D5-11 
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The  Cartesian  error,  Eg,  is  evaluated  in  functional  form  as: 


fl  (Egl’  Eg2’  Eg3*  Eg4’  Eg5*  Eg6*  Eg7) 


where 


Jgl 

Sg2 


E 


Jg3 

g4 

r 

"g5 


-g6 


V 


= Error  introduced  by  squaring  a 

* Error  introduced  by  squaring  b 

= Error  introduced  by  squaring  x 

= Error  introduced  by  dividing  b2  by  a2 

b2  9 

= Error  introduced  by  multiplying  -x  by  x* 

a 

b2v2  o 

= Error  introduced  by  subtracting  — — from  b 

a2 

= Error  introduced  by  the  square  root  function 


The  parametric  generated  error  E is  evaluated  in  functional  form  as 

s 


Ee  ■ f2  <V  V 


where 


E , = Error  introduced  by  sin  or  cos  series  approximation 
gl 

E „ = Error  introduced  by  a or  b multiplication 
g2 


Eq.  D5-12 


Eq.  D6-13 


The  Intel  microcomputer  implementation  of  the  Cartesian  formulation  was  made  difficult 
by  the  following  factors: 


a.  The  limited  integer  range  of  0 to  255  for  single-length  words  and  0 to 
65025  for  double-length  words  caused  overflow  in  multiplication,  re- 
quiring scaling. 

b.  Truncation  caused  by  integer  division  required  scaling. 

c.  Truncation  caused  by  the  integer  square  root  algorithm  required  scaling. 
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Extended  arithmetic  or  floating-point  arithmetic  could  have  been  used  to  alleviate  the 
above  difficulty.  This  approach  is  being  pursued. 

In  a similar  manner,  implementation  of  the  parametric  formulation  was  made  difficult  by 
the  requirements  for  an  accurate  trigonometric  approximation  and  by  the  scaling  required 
to  represent  the  individual  factors  and  the  product  of  their  multiplication. 

D.6  Software 

The  PL/M  language  was  utilized  to  implement  the  ellipse  solution^.  The  Intel  cross- 
compiler  was  used  on  a large  scale  Honeywell  6060  computer  to  produce  a loadable 
object  paper  tape.  The  simulator  program  was  run  later  to  verify  the  proper  operation 
of  the  object  program.  Using  a high  level  language  reduced  training  and  program  develop- 
ment time,  although  the  resulting  program  used  more  memory  resources  than  anticipated. 
A flow  chart  of  the  program  code  is  shown  in  Figure  4. 

D.7  Problem  Areas 


When  implemented  in  PL/M  code  the  Cartesian  approach  revealed  the  following  problems: 

a.  A lack  of  supplied  mathematical  Hbrary  routines.  The  original  square- root 
function  did  not  satisfy  the  accuracy  requirements. 

b.  Division  and  multiplication  worked  properly  on  positive  (unsigned)  numbers  only. 

c.  Integer  division  truncated  the  results,  making  accuracy  difficult. 

d.  Single  and  double  precision  word  arithmetic  was  not  sufficient  to  avoid  overflow. 

e.  The  resulting  error  in  the  x,y  coordinates  was  too  large  to  present  visually 
smooth  ellipses. 


The  parametric  approach  solved  some  of  the  above  problems  and  the  re  suiting  graph  was 
smoother  than  the  Cartesian  results.  However,  a new  series  expansion  for  the 
trigonometric  sine  function  had  to  be  derived. 
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Figure  4. 


PL/M  Program  Flowchart  for  Ellipse 


Some  of  the  system  and  hardware  problems  were* 

a.  The  original  design  used  an  8-bit  DAC,  which  was  expanded  to  10  bits  when  it  was 
realized  that  more  precision  was  required, 

b.  The  classic  problem  of  deciding  whether  a problem  was  hardware  or  software 
was  approached  by  using  the  Intellec  development  system  to  step  through  the 
software  as  well  as  by  probing  the  hardware  circuitry. 
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E.  CONCLUSIONS 

Problem  solving  with  microcomputers  is  likely  to  increase  in  the  near  future.  In  some  cases* 
microcomputers  will  replace  minicomputers  because  of  price,  size,  and  low  power  require- 
ments. To  expedite  development  and  reduce  redundancy,  the  following  recommendations  are 

made: 

a.  Use  a development  team  for  microprocessor  applications.  Include  a numerical 
analyst,  programmer,  and  electronic  engineer. 

b.  Encourage  the  development  of  mathematical  libraries,  and  obtain  access  to 
manufacturers’  libraries. 

c.  Encourage  language  standardization. 

d.  Use  host  software  to  check  and  Simula  e codes. 

e.  Anticipate  requirements  for  multiword  and  floating-point  arithmetic  through  a 
thorough  error  analysis  before  algorithm  implementation. 

f.  Debug  hardware  and  software  on  a development  candidate  unit  such  as  an  Intel 
MCS  or  MDS,  or  a Motorola  EXORciser. 

g.  Consider  the  accuracy  of  the  hardware  (DAC's,  A/D»s,  etc)  before  determining 
software  accuracy.  There  is  no  need  for  software  to  be  more  accurate  than  the 
hardware. 

h.  Employ  scaling  techniques  that  are  appropriate  for  the  individual  application. 

F.  IMPLICATIONS  FOR  ARMY  RESEARCH  AND  DEVELOPMENT 


The  acceptance  and  use  of  microcomputers  by  the  military  will  pace  usage  in  the 
commercial  world.  High  reliability  and  qualification  testing  are  proceeding,  and  some 
acceptance  is  assumed  during  1976. 
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Microcomputers  are  viewed  not  only  as  a direct  replacement  for  minicomputers  but,  be- 
cause of  their  small  size  and  lower  power  requirements,  as  new  devices  for  various 
tactical  applications.  The  use  of  minicomputers  in  radar  and  fire  control  systems  is 
already  well  established.  In  the  future,  micros  will  likely  be  found  on  board  tanks 
(automatic  turret  positioning  and  fire  control,  computer-aided  fuel  injection  systems, 
vehicle  communication  and  display);  in  guided  missiles;  in  electronic  surveillance 
systems;  in  navigation  aids;  and  in  field-portable  communication,  process  control,  and 
logistics  systems.  In  any  instance  where  digital  circuitry  is  applicable,  the  use  of  a 
microcomputer  is  suggested,  based  on  lower  design  costs  as  well  as  increased  capacity 
considerations. 

The  cost  factors  in  using  microcomputers  are  weighed  heavily  in  the  man-time  costs 
necessary  to  prepare  software.  In  a recent  article^,  the  adoption  of  standardized 
processor  algorithms  to  solve  problems  in  electronic  warfare  is  recommended.  The 
author  relates  the  high  life  cycle  costs  of  custom-designed  software  as  compared  to 
the  lower  costs  incurred  by  using  proven  building  block  modules.  Such  an  approach 
could  result  in  libraries  of  algorithms  stored  on  the  disks  of  large-scale  computers  with 
support  software  that  would  allow  the  burning  in  of  read-only-memories  with  those 
programs  of  interest.  In  this  style,  microcomputer  software  applications  would  involve 
integrating  proven  modules  and  preparing  only  the  necessary  high-level  executive 
program  and  if  necessary,  problem-dependent  algorithms. 

For  future  Army  application  of  microprocessors,  a study  of  useful  common  algorithms 
and  preparation  of  guidelines  for  language  standardization  may  be  cost-effective  in 
reducing  software  development  and  redundancy  costs. 
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HIGH  SPEED,  QUALITY  COMPUTING  ON  A MINICOMPUTER 


Edouard  J.  Desautels 
Computer  Sciences  Department 
University  of  Wisconsin 
Madison,  Wisconsin  53706 


ABSTRACT.  Factors  which  should  allow  the  current  generation  of  mini- 
computers to  run  large  scale  scientific  computations  cost-effectively 
in  comparison  to  current  large  systems  are  discussed.  The  obstacles 
to  lower-cost  software  conversion  from  large  systems  will  decrease 
due  to  the  advent  of  minicomputers  with  large  physical  memories  and 
large  address  spaces.  Initiation  of  the  development  of  a numerical 
analysis  problem  solving  system  on  a dedicated  minicomputer  is  proposed 
as  a means  of  exploiting  recent  hardware  and  software  advances. 

1,  INTRODUCTION.  The  title  for  this  paper  may  be  interpreted  by 
some  as  a statement  of  the  obvious.  It  is  intended  to  raise  the 
following  questions.  To  what  extent  can  one  today  conveniently  solve 
large  scientific  problems  quickly  and  as  reliably  on  minicomputers  as 
on  large  shared  systems  such  as  the  C DC  6000,  IBM  370/158+  and  UNIVAC 
1110  series  computers?  What  are  the  obstacles  in  reaping  the  full 
potential  of  minicomputers  in  the  mathematical  software  area?  This 
paper  attempts  to  address  these  and  related  questions. 

As  a preliminary,  let  us  agree  that  minicomputers  for  our  purposes 
are  general-purpose  computer  systems  deemed  sufficiently  inexpensive 
to  avoid  the  need  for  concurrent  sharing  via  a multiprogramming 
operating  system.  Thus  a minicomputer  would  either  be  dedicated  to 
a single  job  or  it  would  run  a simple  batch  system,  or  it  would  be 
turned  over  to  single  users  for  hands-on  use,  etc.  Using  this  defini- 
tion, what  might  be  considered  a minicomputer  at  one  site  (e.g.  a 
$100,000  PDP-11/45)  might  be  used  as  a shared  central  facility  at  another 
site. 


As  we  proceed  with  our  discussion,  the  characteristics  of  mini- 
computers will  also  be  assumed  to  change  with  time.  We  have  to  be 
thinking  of  the  potential  uses  of  minicomputers  as  they  may  be  3 to 
5 years  from  now,  and  what  efforts  have  to  be  undertaken  now  in  order 
to  be  able  to  fully  exploit  them  as  the  new  generation  of  minicomputers 
becomes  available. 
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2.  HIGH  SPEED.  The  measures  of  speed  we  have  in  mind  are  the  raw 
speed  usually  measured  in  millions  of  instructions  executed  per 
second  (MIPS),  sometimes  refined  as  a Gibson  mix  index  (the  sum  of 
the  products  of  instruction  speeds  and  instruction  frequencies  for 
an  assumed  characteristic  instruction  mix).  We  are  also  concerned 
with  the  perceived  speeds  as  measured  by  turnaround  times  under 
actual  operating  conditions. 

Considering  this  second  aspect  first,  many  times  we  have  heard 
undocumented  allegations  as  to  the  overhead  associated  with  large 
operating  systems.  Schneck  in  [9]  discusses  the  factors  which  can 
lead  to  a multiprogramming  system  consuming  an  excessive  fraction  of  a 
system's  resources.  He  advocates  monoprogramming  as  a method  of 
achieving  high  performance  which  yields  advantages  in  turnaround  time, 
efficiency  and  equipment  configuration. 

Returning  to  the  raw  hardware  speed,  in  spite  of  the  small  word 
size  of  most  minicomputers  (e.g.  16  bits),  single-precision  floating 
point  hardware,  for  32  bit  operands,  is  now  available  on  many  systems 
for  $5,000  - $10,000.  Many  minicomputers  have  memory  cycle  times 
of  one  microsecond  or  less.  Thus  when  dealing  with  memory  reference 
instructions,  a rate  of  0.5  MIPS  or  better  is  attained.  Since  some 
minicomputers  are  equipped  with  cache  memories  (a  user-transparent 
high-speed  program  and  data  buffer),  rates  of  1 to  2.5  MIPS  are 
achievable. 

In  a mathematical  problem  solving  situation,  one  may  wish  to 
distinguish  the  speeds  for  two  distinct  phases  of  problem-solving.  The 
first  phase  involves  experimentation,  program  development  and  trial  and 
error  until  a suitable  approach  is  found.  Interractive  computing  is 
the  natural  mode  of  computing  in  this  phase.  When  a suitable  approach 
is  found,  one  enters  phase  two,  the  production  phase  (sometimes 
called  number-crunching). 

One  can  sometimes  rather  easily  cost-justify  using  a minicomputer 
for  the  production  phase.  For  instance,  suppose  one  has  access 
to  a shared  facility  charging  $200  per  CPU  hour,  which  typically  pro- 
vides its  users  with  only  one  tenth  of  its  effective  power  (because 
it  multi  programs  10-15  programs  simultaneously  to  maximize  resource 
utilization).  One  can  imagine  a $100,000  minicomptuer  system  with 
a speed  equivalent  to  the  perceived  speed  of  the  large  system  (a  1 
MIPS  minicomputer  vs  a 10  MIPS  large  computer).  The  breakeven  point 
for  hardware  purchase  occurs  after  500  hours.  Since  a minicomputer 
system  entails  other  costs,  supporting  costs  equal  to  the  initial 
capital  investment  bring  the  breakeven  point  to  1,000  hours  (25 
weeks  at  40  hours/week).  Schaefer  [8]  reports  on  the  use  of  a mini- 
computer (a  Datacraft  6024/4)  for  computations  in  theoretical  chemistry 
as  a realistic  alternative  to  machines  such  as  the  CDC  7600  or  IBM 
360/195.  He  reports  that  the  cost  of  operating  and  capitalizing  a 
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minicomputer  system  can  be  significantly  lower  than  the  cost  of 
operating  a very  large  system  which  has  already  been  amortized,  on 
the  basis  of  computations  per  dollar.  He  concludes  that  as  a 
theoretical  chemist  he  was  able  to  get  three  times  more  computing 
per  dollar  using  a 1973  vintage  minicomputer,  and  that  the  technology 
forecasts  point  to  the  price-performance  bias  towards  minicomputers 
increasing. 

Justification  for  a dedicated  minicomputer  used  for  the  inter- 
active phase  may  be  more  difficult.  If  one  amortizes  the  equipment 
over  4 years,  then  the  $25,000  per  year  may  be  equivalent  to  the  cost 
of  a scientific  programmer  with  his  overhead.  On  this  basis,  the 
justification  may  not  be  so  difficult,  especially  if  the  system  is 
also  used  for  the  production  phase,  as  would  be  expected.  The  inter- 
active phase  is  labor  intensive,  and  any  convenient  interactive 
capability  suffices,  provided  it  is  somehow  linked  to  and  software 
compatible  with  the  production  system. 

3.  QUALITY.  Quality  mathematical  software  is  being  produced  by 
projects  such  as  the  the  National  Activity  to  Text  Software  (NATS)  [1], 
and  organizations  such  as  the  International  Mathematical  and 
Statistical  Library  (IMSL)  [3],  in  addition  to  efforts  put  forth  by 
computer  equipment  manufacturers.  As  might  be  expected,  these 
products  run  on  large  computers  (e.g.  CDC  6000/7000  series,  UNIVAC 
1100  series,  IBM  370/360  series,  etc.). 

Assuming  one  has  access  to  a high  quality  library  of  mathematical 
software,  it  is  more  likely  than  not  written  in  Fortran,  and  one 
has  the  initial  problem  of  selecting  the  appropriate  subroutines, 
then  mastering  the  calling  sequences  and  providing  for  the  data 
handling  requirements.  This  work  can  be  simplified  by  providing  a 
framework  or  a coherent  working  environment  for  the  user. 

In  attempts  to  do  so,  a number  of  experimental  systems  were 
developed  in  the  mid  and  late  1960's,  as  described  in  the  Klerer  and 
Reinfelds  book  [4].  One  in  particular  focussed  on  providing  the  user 
with  a means  of  having  a natural  notation  for  problem  definition  (e.g. 
mathematical  notation),  providing  for  selection  of  appropriate 
algorithms  (e.g.  using  polyalgorithms),  and  using  a natural  representa- 
tion of  results  (e.g.  graphical).  This  was  the  Numerical  Analysis 
Problem  Solving  System  (NAPSS).  The  paper  by  Rice  [7]  is  a 
retrospective  view  of  the  problems  and  prospects  of  NAPSS-like  systems. 
Co-existing  with  the  multi  programmed  operating  system  of  a large 
scale  machine  was  a non-trivial  difficulty  in  the  implementation  of 
NAPSS. 

NAPSS-like  systems  do  not  seem  to  be  available  for  the  current 
generation  of  minicomputers.  This  may  in  part  be  due  to  the  fact 
that  it  was  difficult  to  fit  NAPSS  into  a system  as  large  as  a 
CDC6500.  What  efforts  might  be  undertaken  at  this  time? 
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4.  APPROACHES.  One  can  imagine  using  a minicomputer  as  a program 
generator  (PG).  One  would  present  to  the  minicomputer  a description 
of  a problem  in  a suitable  problem-description-language  (PDL) 
which  presumably  has  the  appealing  aspects  of  NAPSS-like  languages. 

The  PDL  interpreter  checks  the  description  for  consistency  and  a 
limited  notion  of  "correctness".  After  a dialogue  with  the 
problem  originator,  it  generates  a series  of  calls  on  a mathematical 
software  library,  and  transmits  this  collection  to  a large  system, 
as  a remotely- submitted  batch  job. 

The  program  generator  approach  is  attractive,  but  it  probably  is 
the  least  effective  use  of  a dedicated  minicomputer  system.  It  is 
an  attractive  application  for  a minicomputer-based  timesharing 
system,  but  it  may  be  equally  cost-effective  on  a large-scale  system. 

The  second  approach  we  will  discuss  assumes  that,  if  it  is  almost 
cost-effective  to  perform  both  phases  of  problem  solving  on  a mini- 
computer today,  it  will  be  more  so  within  three  to  five  years.  The 
second  approach  involves  dedicating  a minicomputer  to  the  support  of 
a NAPSS-like  system.  Instead  of  having  to  co-exist  with  a general 
purpose  multi  program  operating  system,  we  can  assume  the  operating 
system  will  be  designed  to  meet  the  needs  of  its  one  and  only  user. 

Thus  it  does  not  suffer  the  depletion  of  resources  which  seems  to 
characterize  multi  program  resource  sharing  systems,  it  does  not  have 
to  contend  with  protection  problems  either  for  security  or  as  protection 
against  unreliable  concurrent  users.  Nor  does  it  need  expend  much  time 
in  detailed  cost  accounting  for  resource  utilization. 

Of  course  we  realize  that  some  protection  services  are  useful  as 
debugging  and  program  checkout  tools,  and  that  some  accounting 
information  can  be  used  to  identify  performance  bottlenecks.  However, 
instead  of  supporting  services  for  the  primary  benefit  of  the  sytem 
(e.g.  protection,  accounting),  we  would  prefer  supporting  services 
of  direct  benefit  to  the  user  (e.g.  debugging,  performance  measurements). 

5.  CURRENT  LIMITATIONS.  With  a few  exceptions,  most  current  mini- 
computers are  restricted  to  directly  addressing  64KB  (8  bit  bytes). 

In  a mathematical  software  context,  this  would  provide  for  a maximum 
of  16K  floating  point  32  bit  words,  or  a square  matrix  of  126  by  126. 

In  most  systems  this  would  exhaust  all  of  memory.  In  some,  another 
32KW  (16  bits)  would  be  available  for  program  storage.  In  a few  systems, 
one  could  have  perhaps  1 MB  of  physical  memory,  while  still  being 
restricted  to  the  small  address  space  described  above.  The  report  by 
Poppendieck  and  Desautels  [6]  describes  the  range  of  memories  and 
restrictions  available  on  current  minicomputers,  and  their  implications. 
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As  mentioned  previously,  floating  point  arithmetic  of  suitable 
speed  and  precision  is  now  available  on  many  minicomputers.  The 
availability  of  sufficient  mass  storage  (e.g.  disk)  used  to  be  a 
problem  with  minicomputers.  Fortunately  relatively  inexpensive 
drives  (80  MB)  are  available  for  under  $20,000. 

One  additional  consequence  of  operating  a dedicated  system  is  the 
possibility  of  exploiting  the  microprogramming  option  which  is 
available  on  some  minicomputers.  It  is  very  difficult  to  support 
user  microprogramming  on  a shared  system,  and  it  is  simpler  on  a 
dedicated  system.  In  some  cases  this  can  yield  a performance 
increase  of  a factor  of  2-5  or  better. 

6.  A POSSIBLE  APPROACH.  The  mathematical  software  written  for  large 
computers  might  be  adapted  for  use  on  the  current  generation  of 
minicomputers.  Much  of  this  software  is  written  in  Fortran,  and 
adaption  might  appear  to  be  straightforward.  However  since  most  current 
minicomputers  have  difficulty  handling  even  16K  floating  point  operands, 
much  of  the  code  would  have  to  be  rewritten  to  implement  in  software 
the  virtual  memory  hardware  support  for  large  direct  addressing  found 
on  some  large  computers.  Even  after  an  expensive  adaption  process, 
performance  would  leave  much  to  be  desired. 

Recognizing  that  some  minicomputers  now  support  direct  addressing 
spaces  comparable  to  current  large  computers,  and  assuming  that 
technological  forecasts  indicating  the  cost  of  logic  and  memory  decreasing 
by  a factor  of  two  every  two  to  three  years,  we  would  propose  to  initiate 
development  of  a NAPSS-like  system  on  a dedicated  minicomputer  with 
a direct  addressing  capability  of  at  least  1 MB  (20  bits).  This  is 
equivalent  to  a floating  point  word  capacity  of  256  KW  (32  bit  words) 
which  exceeds  the  direct  addressing  capability  of  large  systems  such 
as  the  CDC  6000  series  and  UNIVAC  1100  series  (restricted  to  65KW 
each).  By  the  time  efforts  to  convert  large  machine  code  into 
16-bit  minicomputers  might  begin  to  bear  fruit,  minicomputer  technology 
is  likely  to  have  advanced  to  the  point  where  large  memories  and 
large  direct  addresses  would  have  invalidated  most  of  the  conversion 
effort.  The  recent  interview  on  the  design  of  the  PDP-11  is 
illuminating  in  this  respect  [10]. 

One  can  argue  that  the  benefits  of  improved  technology  apply  to 
large  and  small  systems  equally,  yet  it  appears  to  be  the  case  that 
the  smaller  systems  increase  in  capability  while  decreasing  in  price 
much  more  rapidly  than  the  large  systems. 

A very  attractive  development  approach  would  involve  beginning 
software  development  on  a minicomputer  which  has  the  appearance 
from  the  programming  viewpoint  of  having  a 32  bit  word.  The  Interdata 
7/32  has  the  logical  characteristics  of  a 32  bit-word  computer,  but  it 
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provides  this  at  a low  cost  by  using  16  bit  internal  data  paths. 

Having  developed  software  on  this  "slow"  system,  one  can  then  upgrade 
into  a faster  system,  the  Interdata  8/32,  which  is  upward  compatible 
with  the  7/32.  The  8/32  provides  32  bit  data  paths  and  other 
enhancements  such  as  an  instruction  cache,  so  that  its  performance 
is  claimed  to  equal  that  of  an  IBM  370/158. 

A computer  such  as  the  8/32  is  attractive  in  that  it  also 
supports  a writeabl e-control  store,  with  which  one  can  write  micro- 
code tuned  to  support  the  current  application.  This  can  provide 
performance  increases  and  opportunities  for  monitoring  arithmetic 
errors. 

Further  performance  increases  can  be  obtained  through  the  addition 
of  outboard  arithmetic  units  such  as  the  one  manufactured  by  Floating 
Point  Systems  [2],  In  principle  it  is  capable  of  a maximum  of  12 
million  floating  point  instructions  per  second,  on  38  bit  operands. 

The  problem  of  course  is  how  to  keep  it  busy  [5], 

7.  PRACTICAL  CONSIDERATIONS.  Developing  software  is  an  expensive 
activity,  and  it  is  not  likely  to  decrease  in  cost.  The  prospects  for 
transporting  software  ("software  portability")  with  little  effort  are 
not  too  promising  for  mathematical  software,  because  of  the  consequences 
of  minute  differences  in  arithmetic,  as  well  as  the  usual  difficulties. 

Since  the  cost  of  main  storage  continues  to  decrease,  it  would 
seem  foolish  to  expend  much  software  conversion  effort  to  adaptation  of 
quality  mathematical  software  developed  for  large  machines  so  that 
it  can  run  on  minicomputers  with  minute  direct  addressing  capabilities. 

8.  CONCLUSIONS  AND  RECOMMENDATIONS.  Current  minicomputers  are  in 
some  instances  as  cost-effective  for  scientific  computations  as 
large  scale  systems.  Within  the  next  few  years,  minicomputers  are 
likely  to  be  much  more  cost-effective,  provided  one  funds  a way  to 
minimize  the  cost  of  adapting  quality  mathematical  software  which  has 
been  developed  for  large  scale  systems. 

Such  adaptations  should  be  performed  for  the  newer  generation  of 
minicomputers  with  larger  direct  addressing  capabilities  (1  MB  or 
greater),  so  that  conversion  costs  remain  reasonable. 
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ABSTRACT . 


Most  present  day  real-time  data  systems  at  test  ranges  have  been  implemented 
using  large  scale  computers  such  as  the  UNIVAC  1108’ s at  White  Sands  Missile 
Range.  In  selecting  a new  real-time  system  for  Yuma  Proving  Ground  (YPG) , the 
US  Army  Test  and  Evaluation  Command  is  applying  a new  approach  - Think  Small! 

With  the  increasing  sophistication  and  capability  of  minicomputers  rapidly 
approaching  those  of  large  scale  computers,  it  is  possible,  by  separating  the 
various  processing  elements  used  in  a real-time  system  and  substituting  inter- 
connected minicomputers,  to  do  the  same  job  for  much  less  cost.  This  is  the 
Distributed  Computing  System  approach. 

Several  distributed  computer  configurations  now  in  existence  are  examined  in 
this  presentation  as  well  as  the  YPG  proposed  system.  Included  are  the  Carnegie- 
Mellon  Multi-Processor  (C.mmp)  System,  the  Bell  Labs  Distributed  Network  and 
the  University  of  California  at  Irvine  (UCI)  System. 


INTRODUCTION. 

The  paper  I am  to  deliver  covers  in  part  a subject  on  which  one  of  the 
acknowledged  experts  is  present.  However  it  is  not  a treatise  on  the  subject 
but  an  information  brief  concerning  Yuma  Proving  Ground  and  how  Yuma 
intends  to  implement  a distributed  computer  network.  This  paper,  originally 
a briefing  prepared  for  the  DoD  Research  & Engineering  Directorate  (DDRE) , 
contrasts  some  of  the  known  current  distributed  computer  systems  with  the  type 
which  Yuma  intends  to  install.  We  tried  to  show  the  practicality,  desirability, 
and  feasibility  of  a ring  type  distributed  network  for  Yuma  without  getting  too 
bogged  down  in  technical  details,  since  those  briefed  were  management  oriented 
rather  than  ADPE  technically  oriented.  Nothwithstanding,  these  DDRE  managers 
are  at  a policy  making  level  influencing  the  entire  scope  of  ADPE  within  the 
DoD,  Prior  to  this  briefing  their  ADPE  orientation  was  towards  large  stand 
alone  computers,  which  is  diametrically  opposed  to  the  ring  concept  planned 
for  YPG.  This  is  the  briefing  paper  substantially  as  it  was  presented.  The 
purpose  was  to  sell  a concept  and  it  succeeded  in  that  objective. 
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At  TECOM  we  have  a diversity  of  weapon  systems  to  be  tested.  Formost  among 
these  are  the  Army's  big  five  developments. 

*Mechanized  Infantry  Combat  Vehicle  (MICV) 

*Advanced  Attack  Helicopter  (AAH) 

*Utility  Tactical  Transport  Aircraft  System  (UTTS) 

*Surface-to-Air  Missile  Development  (SAM-D) 

*XMI  Tank 

Both  instrumentation  and  ADPE  must  be  able  to  react  to  the  testing  requirements 
of  these  items  as  well  as  all  other  Army  materiel  in  a timely  and  economical 
manner . 

At  YPG  for  example,  these  requirements  are  prompted  by  testing  application 
ranging  from  ground  vehicles,  such  as  MICV  & the  XM1  Battle  Tank,  to  Artillery 
such  as  the  XM204  Howitzer;  to  Aircraft-Armament;  and  most  recently  the  Global 
Positioning  System,  A Tri-Service  Responsibility  of  the  Air  Force. 

Figure  1 depicts  the  Global  Positioning  System  (GPS)  concept.  When  complete, 
GPS  will  provide  time  and  positioning  data  for  any  receiving  unit  at  any  location 
on  earth.  The  initial  Yuma  testing  will  constist  of:  (1)  A simulated  or  inverted 

range,  & (2)  Testing  with  up  to  nine  satellites  which  will  be  within  the  reception 
area  of  YPG  approximately  four  hours  per  day. 

During  both  the  Aircraft  Armament  tests  and  the  Global  Positioning  System 
tests,  large  amounts  of  data  will  be  collected  within  short  time  frames.  It 
is  essential  to  do  as  much  processing  in  real  time  as  possible,  in  order  to 
speed  the  analysis  of  the  data  collected  and  to  prepare  reports  in  the  shortest 
possible  time  frame . 

Recognizing  that  a large  increase  in  ADPE  workload  would  result  from  these 
testing  requirements,  TECOM  initiated  action  to  review  all  ADPE  at  YPG.  As  a 
result  several  alternatives  were  identified  which  were  then  studied,  in  full 
recognition  of  the  following  dynamic  technological  thrusts  which  impact  ADP 
today: 

1.  Hardware  costs  are  decreasing  by  a factor  of  100  each  ten  years,  with 
every  indication  of  continuation  for  the  next  ten  years. 

2.  Software  costs  are  decreasing  by  a factor  of  10  each  ten  years,  with 
every  indication  of  continuation  for  the  next  ten  years. 

3.  Data  communication  costs  are  decreasing  by  a factor  of  10  each  ten 
years,  with  reasonable  expectations  of  continuation  for  the  next  ten  years. 

4.  Personnel  costs  are  essentially  stable,  after  accounting  for  inflation. 
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The  consequences  are  that  overall  - personnel  costs  are  looming  larger  relative 
to  all  other  costs,  with  hardware  diminishing  as  a cost  factor. 

But  most  present-day  real  time  data  systems  have  been  implemented  using 
large  scale  computer  systems  such  as  at  White  Sands  Missile  Range  where  large 
UNIVAC  llOS’s  are  used  in  the  real  time  tracking  of  missiles.  These  systems 
effectively  control  input,  output  and  processing  of  all  data  for  real  time 
systems,  (Fig  2)  As  you  are  aware,  the  cost  of  these  large  scale  computers 
is  always  high,  usually  several  millions  of  dollars  or  more. 

On  the  other  hand  the  ever  increasing  sophistication  and  capabilities  of 
minicomputers,  are  rapidly  approaching  those  of  large  scale  computers.  This 
is  accomplished  by  separating  the  various  processing  elements  used  in  a real 
time  system  and  substituting  interconnected  minicomputers  for  each.  The  result- 
ing  multi-processor  system,  using  a number  of  minicomputers,  can  do  the  same 
real  time  job  as  a large  scale  system  and  for  much  less  cost. 

As  a result,  the  thrust  of  our  current  thinking  is  more  in  terms  of  small 
minicomputers.  (Fig  3)  For  this  reason  new  practical  and  cost  effective 
solutions  for  meeting  the  future  ADPE  requirements  of  YPG  were  considered  along 
with  the  traditional  approach. 

With  the  most  recent  availability  of  low-cost  minicomputers  a computer  trend 
toward  localized  computing  at  the  site  of  the  user  is  developing  in  industry  and 
in  Government  as  well.  For  example,  here  in  TECOM  the  computer  has  become  an 
integral  part  of  instrumentation,  test  chambers,  and  data  acquisition  and  control 
systems.  This  requires  that  at  least  part  of  the  computing  facility  be  at  the 
application  site.  The  minicomputer  has  been  ideally  suited  to  such  real-time 
applications.  This  leads  us  to  a concept  of  integrating  a number  of  mini- 
computers into  a computing  system,  (Fig  4) 

In  effect  this  is  an  information  utility  made  up  of  a number  of  minis 
rather  than  one  or  two  maxi-computers.  This  concept  is  known  as  a distributed 
computing  system.  The  goal  of  distributed  computing  is  an  integrated  hardware 
system  which  provides  reliable  service  at  low  cost. 

There  are  several  different  kinds  of  distributed  mini-computer  networks 
in  existence  today  - all  of  which  are  in  various  stages  of  development.  Rep- 
resentative of  these  are: 

*The  Carnegie-Mellon  Multi-Mini  Processor 

*The  Bell  Labs  Spider  Network 

*The  University  of  California  at  Irvine  Ring  Network 

Figure  5 is  a schematic  of  the  ADPE  contained  in  the  Carnegie-Mellon  Multi- 
Mini-Processor  or  C.mmp.  This  system  will  contain  up  to  sixteen  mini-processors, 
five  of  which  are  shown.  These  are  connected  through  a switch  to  memory  boxes. 
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The  switch  allows  any  processor  to  access  any  of  the  memory  boxes. 

The  processors  are  not  permanently  attached  to  a memory  box,  rather  each 
time  a processor  wishes  to  access  a particular  memory  a connection  is  established 
through  the  switch  for  that  access,  sixteen  separate  processor-memory  connections 
will  be  possible  simultaneously. 

Peripheral  devices  are  connected  to  buses  associated  with  each  processor 
and  gain  access  through  these  buses  to  the  shared  memory.  Each  processor  can 
intercept  each  of  the  other  processors  at  several  priority  levels  and  can  start 
and  stop  other  processors.  This  enables  one  processor  to  task  another  to 
perform  an  operation  for  a program  running  on  the  first  processor.  Thus  user 
programs  are  not  restricted  to  execute  on  any  particular  processor. 

Figure  6 illustrates  the  techniques  used  to  keep  the  highest  priority  jobs 
in  processing.  If  a new  user  program  enters  the  queue  and  is  a higher  priority 
than  the  job  in  "A"  then  the  priority  will  be  compared  with  the  priority  of  the  job 
in  processor  "10".  If  the  new  job  Is  a higher  priority  it  will  take  the  place  of 
the  job  in  processor  "10”  and  that  job  will  be  returned  to  the  waiting  queue  as 
the  highest  priority  in  the  queue. 

An  algorithm  stored  in  the  memory  controls  the  matching  of  priority  jobs  and 
processors,  and  insures  that  those  jobs  with  the  highest  priority  are  processed 
first . 

There  are  several  benefits  to  the  C.mmp  - (1)  by  having  multiple  processor 
units,  the  failure  of  any  one  will  not  crash  the  total  system.  Removal  of  one 
processor  from  the  system  will  affect  the  system  so  little  that  it  may  hardly  be 
noticed. 

(2)  Minicomputers  are  produced  in  large  quantities  at  low  cost,  and  as  it 
turns  out  the  C.mmp  system  costs  less  than  one  half  of  what  a single  machine  of 
similar  power  would  cost. 

(3)  Interconnected  minis  allow  user  organizations  to  start  at  a level 
using  only  the  number  of  minis  needed  and,  as  requirements  and  usage  grows, 
expansion  can  be  achieved  by  adding  processors  as  required.  This  technique  is 
much  more  cost  effective  than  replacing  a large  processor. 

(4)  In  the  C.mmp  configuration,  if  required  all  the  processors  may 
cooperate  to  solve  a single  problem  or  each  processor  may  be  dedicated  to  a 
different  user  or  any  combination  in  between. 

The  C.mmp  is  not  a geographically  distributed  network.  All  processors  are 
in  the  same  room.  It  is  however,  an  alternative  to  a large  maxi-computer . 

Figure  7 shows  a Bell  Labs  type  Distributed  Network.  This  network 
has  a central  control  known  as  "Spider",  All  communication  must  first  go  to  Spider 
for  messages  to  be  routed  to  the  appropriate  network  member.  This  creates  a 
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central  decision  point  facilitating  workload  distribution  and  resource  sharing* 
Depicted  are  different  configuration  sets  consisting  of  three  mini's  each*  (The  Bell 
Lab  system  actually  links  eleven  mini-computers  of  five  different  types.)  Each 
machine  connects  to  Spider  through  a terminal  interface  unit  (TIU) . The  basic 
idea  for  this  system  involves  the  transmission  of  data  between  minicomputer 
terminals  in  packets,  or  bursts,  rather  than  a uniform  stream.  Each  mini- 
computer terminal  is  associated  with  a buffer*  Each  buffer  is  large  enough  to 
hold  at  least  one  data  packet*  This  method  provides  the  time  periods  needed  for 
making  effective  routine  decisions  by  Spider  which  operates  as  a data  switch. 

Spider  is  the  central  component  of  the  system  and  switches  data  between  the 
various  minicomputers*  Each  of  the  minicomputer  terminals  sends  out  data  bursts 
in  response  to  encoded  control  signals  generated  by  a Spider  controlled  interface 
technique.  Data  transmitted  from  a mini  terminal  is  picked  out  by  a multiplixer 
at  intervals  determined  by  the  encoded  interface  control  signals* 

In  doing  this,  the  multiplixer  assembles  the  data  into  the  proper  packet 
format  prior  to  sending  it  to  Spider  on  one  of  the  network  transmission  lines. 
Packets  of  data  are  transmitted  by  placing  messages  into  an  open  time  slot  on  a 
conveyor  belt-like  channel.  Input  messages  are  received  similarly* 

The  advantages  of  the  Spider  Network  are  the  same  as  for  the  C.mmp* 

However,  this  system  is  distributed  over  a larger  area,  being  specifically 
designed  to  support  different  laboratories  conducting  experiments  and  research. 

One  potential  shortcoming  of  the  system  is  the  switch  communication  computer, 
the  Spider,  thru  which  all  traffic  flows.  Failure  of  this  component  causes  a 
failure  in  the  total  network. 

Figure  8 depicts  the  University  of  California  (UCI)  type  system,  which  has  a 
number  of  minis  connected  to  a single  transmission  line  in  a ring  configuration. 

Each  mini  interfaces  with  the  line  via  a device  called  a ring  interface,  or  RI, 

Each  of  the  RI  units  is  programmed  to  impart  outgoing  and  incoming  computer 
information. 

Each  RI  recognizes  information  on  the  line  addressed  to  its  associated  mini 
and  passes  it  on  to  that  mini  while  rejecting  information  not  so  addressed.  The 
transmission  line  is  designed  so  that  all  information  flows  in  one  direction. 

Since  all  RITs  continually  reject  or  accept  information  from  the  line,  they  can 
spot  available  time  slots  in  which  outgoing  information  can  be  inserted  without 
interfering  with  other  data. 

Thus,  control  of  data  flow  and  destination  is  time  distributed  around  the 
ring.  If  one  or  more  of  the  minis  or  RI's  breaks  down,  the  rest  of  the  system 
continues  to  function. 
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The  RI's  are  equipped  with  self-monitoring  circuits  that  detect  any 
malfunctioning  of  normal  operations.  Since  RI  or  mini  failure  could  cause 
road  blocking  of  the  transmission  line,  the  self-monitoring  circuit  acts  as  a 
circuit  breaker  to  take  the  RI  and  its  mini  off  the  line  if  appropriate. 

The  only  store-and  forward  message  switching  technique  required  in  the 
system  is  that  between  each  mini  and  its  RI.  Besides  greatly  reducing  trans- 
mission time,  this  simplifies  the  RI  design,  thus  the  RI's  can  be  built  for 
much  less  than  any  other  type  of  communications  processor  - less  than  five 
hundred  dollars.  This  further  enhances  cost  effectiveness  when  the  ring  is 
expanded  by  hooking  up  an  additional  RI  to  the  line  and  connecting  it  to  a mini. 

Each  RI  contains  a multi  address  code.  When  a message  from  another  RI 
passes  it  on  the  line,  the  destination  code  is  compared  to  the  stored  addresses. 
If  a match  results,  the  message  is  accepted  and  the  information  transferred  to 
the  minicomputer.  If  the  minicomputer  cannot  accept  the  message  because  it  is 
occupied,  the  message  is  transmitted  to  the  next  appropriate  computer  in  the 
ring. 


Figure  9 depicts  the  completed  UCI  ring  network  including  its  associated 
peripherals,  (i.e.,  printers,  plotters,  & disks).  Low  cost  is  achieved  here  in 
several  ways.  Each  of  the  component  computers  of  the  network  is  relatively  small 
and  inexpensive.  The  system  software  is  a modest  programming  effort,  existing 
other  software  can  be  integrated  readily,  and  finally  standardized  interfaces 
to  the  communication  ring  are  available  at  low  cost  as  previously  indicated. 

The  same  benefits  apply  here  as  in  the  previous  networks.  However  it  is 
even  more  reliable  because  no  central  control  computer  is  required  as  in  the 
Bell  Lab's  system.  Also  the  ring  concept  has  the  advantage  of  allowing  for 
greater  geographical  dispersion  than  the  C.mmp. 

Having  examined  3 examples  of  minicomputer  distributed  networks,  let  us 
turn  now  to  TECOM. 

Profiting  from  the  R&D  efforts  of  others,  TECOM  at  Yuma  Proving  Ground 
is  moving  into  this  new  technology  with  the  expansion  of  the  real-time  data 
acquisition  network  on  the  Aircraft  Armament  Range.  This  will  be  the  first 
non-laboratory  application  of  the  ring  concept.  The  ADPE  (some  now  in  operation) 
to  be  included  in  this  distributed  network  will  range  from  programmable 
calculators,  to  minicomputers,  to  a large-scale  central  real-time  and  batch 
processing  system. 

Shown  schematically  in  Figure  10  is  the  existing  instrumentation/ADPE  for 
the  Aircraft  Armament  Range  for  Yuma's  Cibola  Range. 

Two  computers  (an  EMR  6130  and  an  IBM  7044/7094  Direct  Coupled  System) 
comprise  the  main  computer  site.  A real-time  cinethedolite  data  system  (CINE), 
located  at  three  separate  sites  tracks  a target  and  transmits  position  data  to 
the  main  computer  site  in  real-time.  A precision  aircraft  tracking  system (PATS) , 
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which  is  a Laser  Tracker,  also  tracks  and  transmits  position  data  to  the 
central  site.  The  data  flow  illustrated  occurs  five  times/sec.  Total 
computation  time  for  all  processors  in  the  central  computer  takes  under  one 
hundred  eighty  milliseconds. 

Figure  11  is  a schematic  of  how  the  range  will  look  with  the  ADPE 
instrumentation  system  now  in  procurement.  The  two  additional  laser  trackers 
(PATS  2 & 3)  each  have  a minicomputer  for  control,  data  logging,  and  formatting. 

The  position  locating  system  at  site  7 (PLS)  also  contains  a minicomputer.  All 
data  from  these  data  collection  systems  will  be  transmitted  to  the  main  computer 
site  for  real-time  data  reduction  with  output  then  being  sent  back  to  the  test 
site  for  analysis  and  control  by  the  test  officer.  The  output  displays  (DISP) 
also  contain  minicomputers. 

In  addition  to  the  instrumentation  and  ADPE  currently  in  procurement;  the 
central  real-time  computers  at  YPG  are  also  expected  to  be  replaced.  The  re- 
placement will  take  the  form  as  shown  in  Figure  12  and  represents  the  new 
technology  in  software  and  hardware  which  we  referred  to  earlier  as  a distributed 
computer  network.  The  net  will  incorporate  key  minicomputers  at  the  test  sites 
for  data  reduction  and  processing.  In  this  net  execution  of  programs  both  real- 
time and  batch,  will  automatically  be  performed  on  the  optimum  system  available. 

Benefits  which  are  expected  to  accrue  from  the  YPG  distributed  computing 
network  are:  a higher  level  of  performance;  increased  reliability  and  availability 

because  of  the  redundant  nature  of  the  network;  and  since  the  net  is  fundamentally 
a complex  of  minicomputers  - expandability.  In  the  case  of  YPG,  the  capital 
investment  for  the  basic  mini-ring  network  is  much  less  than  it  would  be  for  a 
new  stand  alone-system.  That  alone  would  make  it  worthwhile.  Additionally,  by 
taking  advantage  of  some  minis  currently  at  YPG  the  capital  cost  of  this  network 
will  be  even  less,  while  operating  expense  for  this  system  will  be  approximately 
seventy-eight  thousand  dollars  per  year  less  than  the  current  systems. 

Since  the  new  equipment  will  cost  three  hundred  thirty  thousand  dollars 
it  will  pay  for  itself  in  a little  over  four  years,  primarily  from  the  savings 
in  maintenance  cost. 

In  conclusion,  a real  challenge  we  face  at  TECOM  is  the  increasing 
sophistication  of  DoD  materiel  and  the  demand  this  places  on  testing  technology. 

The  YPG  distributed  computer  system  is  but  one  example  of  what  we  are  doing  in 
TECOM  to  enhance  our  testing  capability  in  a cost  effective  manner. 
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LINEARIZED  LEAST  SQUARES 

Larry  M.  Sturdivan  arid  John  W.  Jameson 
Biomedical  Laboratory 
Edgewood  Arsenal 

Aberdeen  Proving  Ground,  Maryland  21010 


ABSTRACT.  The  paper  discusses  iterative  methods  for  fitting  linear- 
izable  functions  by  weighted  least  squares.  Starting  values  of  the 
coefficients  are  derived  from  data.  The  method  is  adaptable  for  some  desk 
top  programmable  calculators.  The  effect  of  various  weighting  schemes  on 
the  residual  sum  of  squares  is  discussed.  Two  methods  are  presented  for 
making  the  residuals  in  the  dependent  variable  sum  to  zero. 

1.  INTRODUCTION.  Since  the  early  19th  century  the  method  of 

least  squares  has  been  the  most  widely  used  mathematical  tool  for  determining 
the  values  of  constant  parameters  in  linear  functions  (linear  with  respect  to 
the  parameters  to  be  determined).  The  general,  nonlinear  least  squares 
problem,  by  contrast,  remains  a largely  unsolved  problem.  There  is, 
however,  a class  of  nonlinear  functions,  which  some  authors  call  "intrinsically 
linear",  for  which  good  estimates  of  the  least  square  values  of  the  fitted 
parameters  may  be  easily  and  reliably  obtained. 

2.  STATEMENT  OF  PROBLEM.  Let  us  assume  that  we  have  a dependent 
variable  y which  may  be  expressed  as  an  explicit  function  of  a set  of 
independent  variables  and  constant  parameters  whose  values  are  to  be 
estimated. 

i.e.  y = F (x.,  bj)  ; i = 1 n ; j = 1,  ...  m (1) 

y = dependent  variable 
x j-  set  of  n indep.  variables 
b.=  set  of  m constant  parameters 

J 

The  following  algorithm  may  be  used  to  determine  the  bj  if  F may 
be  made  linear  with  respect  to  the  bj  by  suitable  mathematical  transformation, 
i.e.  if  there  is  some  function  h(y)  such  that 
m 

h(y)  = 2 b.  g.  (x.);  i = 1,  ...,  n (2) 

j=l  J J 1 

and  dh(y)/dy  is  defined 

The  functions  F and  gj  may  involve  known  constants  which  are  not  members 
of  bj;  e.g.  the  fourth  of  the  following  examples  of  intrinsically  linear 
functions: 
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y = In  (bx) 
y = arctan  (bx) 


y = n X,-  1 
i=l  1 


K,  + aexp  (b,*,)  k0,  K,  Known 


Of  course  conventional  least  squares  could  be  used  directly  on  equation  2. 
The  result,  however,  is  the  minimization  of  squared  errors  in  h,  not  y. 

3.  THE  ALGORITHM.  We  designate  the  difference  between  the  actual 
and  predicted  value  of  y for  the  kth  data  point  as: 

Ayk  = - F (bj>  xi|<)  k = ls  •••»  P 


Ahk  = h(yk)  - h(yk  - Ayk)  - h(yk)  - h(F(bJ.,  xik)) 

Expanding  h(y  - Ay)  in  a Taylor  series  with  a remainder  term,  we  get 
h(y  ~ Ay)  = h(y)  - Ayh'(y)  + j.  + 


+ * . + ( -L 


where  h'^'(y)  is_the  q^  derivative  of  h(y)  with  respect  to  y,  evaluated 
at  y,  and  where  y is  in  the  interval  between  y and  y + Ay.  Let  q = 1,  then 

f=My) 

Now  we  want  to  minimize 


2 (Ayk)  = 2 (Ahkr  (3) 

k=l  k k=l 

[h’(yk)]2 

This  is  equivalent  to  weighted  least  squares  on  equation  2 with  weights 

uk2  - [W(yk)]2;  k = 1 p (4) 

Initially  we  do  not  know  the  value  of  the  Ay|<,  much  less  the  position 

of  yk  in  the  interval,  so  we  use  the  data  points  yk  to  approximate  the 

We  may  then  use  the  Ayk  from  the  first  pass  to  calculate  improved 
weights  and  make  a second,  etc.,  continuing  the  iteration  until  stability 
is  reached.  Several  methods  of  obtaining  improved  estimates  of  the  uk  of 

equation  4 have  been  tried.  Some  authors  have  used  regression  line  values 
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for  yk  in  equation  4;  in  effect  using  the  opposite  end  of  the  interval 
from  the  data  point.  This  method  improves  the  fit  (decreases  the  sum 
of  deviations  in  equation  3)  most  of  the  time,  but  if  y.  is  closer  to 
the  data  point  than  the  regression  value  the  fit  thus  obtained  is  actually 
worse  than  the  results  of  the  first  iteration.  Other  methods  tried 
include: 


a) 


b) 


values. 

c) 


The  first  q terms  of  the  Taylor  expansion  about  the  data  values. 
The  first  q terms  of  the  Taylor  expansion  about  the  regression 


u,  2 = ( 


— ! ) ( ! ) 

h'(yk)  h'(yk  + Ayk) 


(geometric  mean  of  data 
and  regression  values) 


d)  uk2  = (Ayk/Ahk)2 


(A1 s from  previous  iteration) 


Methods  a and  b have  the  serious  drawback  of  requiring  the  first  q 
derivatives  of  h,  almost  always  an  unpleasant  task  whatever  the  form 
of  h.  It  is  wasted  labor  in  most  cases,  as  well,  for  method  d usually 
gives  results  as  good  as,  or  better  than  any  other.  In  fact,  intuitively 
one  might  think  that  method  d would  lead  to  convergence  on  "the"  least 
squares  answer.  However,  we  must  remember  that  we  are  determining  m weights 
and  n b's  from  just  m data  points,  an  underdetermined  system.  Thus  as  long 
as  the  weights  are  determined  from  the  data,  as  they  must  be,  one  cannot 
guarantee  convergence  on  the  global  minimum.  Indeed,  we  have  observed 
intermediate  iterations  in  which  the  sum  of  squared  deviations  in  y was 
slightly  smaller  than  the  "converged"  value. 


Unlike  the  linear  least  squares  case,  the  above  algorithm  does  not 
produce  a zero  sum  of  deviations  in  y: 

i .e.  ZAy  = ZuAh  ^ o 
o 

Although,  zu  Ah  = o 

If  one  has  an  overriding  reason  for  desiring  a zero  sum  of  deviations 
in  y,  it  may  be  obtained  by  minimizing 

Zu(Ah)2  = ZAyAh  (5) 


which,  of  course  does  not  minimize  the  sum  of  squared  deviations  in  y. 

In  many  cases  the  increase  in  squared  deviations  is  tolerably  small  with 
minimization  of  equation  5 (i.e.  using  weights  u rather  than  u2)  if  zero 
sum  of  deviations  is  desired.  A second  method  of  obtaining  a zero  sum 
of  deviations  is  to  force  the  fitted  line  to  pass  through  the  weighted 
mean  h*,  x^*.  i.e.  fit  the  model 
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m-1 

h - h*  = £ bi  g,  (xi  - x-*) 
j=l  J J 

i = 1 , . . . , n 

where 

h*  = £uh 
ITT 

and 

x.*  = Eux . 

zu 

2 

using  weights  u . The  summation  running  to  m-1  indicates  the  absence 
of  an  intercept;  i.e.  one  of  the  b's,  say  b^,  for  which  gf  = 1 . 

Both  methods  of  obtaining  EAy  = o work  best  when  the  u's  are  calculated 
by  method  (d)  above. 


4.  AN  APPLICATION.  An  example  of  an  intrinsically  linear  function 
is  the  Logistic  (probability  distribution)  function 

y _ 1 + a/xb] 

The  linearized  form  is 

2 

h(y)  = ln(y/l-y)  = Ina  + b.ln  x = £ b.g.(x) 

1 i=l  1 1 


where  g-j  = Inx 

g2  = i 


b2  = lna 


Also 


1 


1 


u = h'(y)  = y(l-y) 


The  following  table  shows  how  much  improvement  in  the  root  mean  square 
error  was  gained  by  using  the  iterative  algorithm  on  the  Logistic  function 
over  an  unweighted  fit.  As  expected,  the  fit  is  perfect  when  the  data  all 
lie  right  on  the  fitted  line,  whether  the  least  square  fit  is  weighted  or 
not.  As  the  data  get  more  scattered  (i.e.  with  a correlation  coefficient 
significantly  less  than  1.00)  the  reduction  in  root  mean  square  error  is 
marked  (Case  III). 


Case 

Correl . 
Coeff. 

Root  Mean  Sq. 
Unweighted 

Error 

Weighted 

% 

Improvement 

I 

1.00 

0 

0 

0 

II 

.98 

0.0496 

0.0488 

2 

III 

.94 

0.0419 

0.0368 

12 
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NONLINEAR  SPLINE  REGRESSION  ON  MINI-COMPUTERS 
Philip  W.  Smith* 

Appendix 

Stanley  Hrnclr  and  Philip  W,  Smith* 

Department  of  Mathematics 
Texas  A&M  University 
College  Station,  Texas 


1.  Introduction.  In  this  paper  we  will  present  our  experiences  in 
attempting  to  solve  certain  nonlinear  regression  problems  on  a mini-computer. 
More  specifically,  we  used  spline  curves  of  a specified  order  and  treated 
the  knots  as  nonlinear  parameters. 

Section  2 contains  the  relevant  notation  and  theory.  In  seqtion  3 we 
make  some  final  remarks.  At  the  end  is  an  appendix  with  a listing  of  the 
programs  and  an  abbreviated  description  of  their  use  with  examples.  The 
reader  who  is  unfamiliar  With  splines  should  first  glance  through  the  examples 
in  the  appendix. 


2.  Description  of  Numerical  Solution.  Throughout  this  section  and  the 
next;  the  following  notation  will  be  used.  Let  k be  a positive  integer  and 
t be  a knot  vector  satisfying 


t:=a~t1  = . . ,~t^<  t^-pi  • • « < "...  - t - b 


n ' n+1 


"n+k 


where  t..+k  > 1 j * Given  a k and  a knot  vector  £ one  forms  the  normalized 


k-1 


B-splines  N,  , by  [1] 

J , K1 

Nj,kc£,x)  = Ctj+k  " VIV'",tJ+kJ(’“T>+ 

where.  [t  denotes  the  k-th  divided  difference  operator.  A 

spline  of  order  k with  knot  vector  t is  any  function  of  the  form 

n 

s(£,A,t)  = AjN^k(t,l). 


*This  author  supported  by  the  U.S.  Army  Research  Office  under  Grant  Number 
DAHC  04-75-G-0186. 


Note:  (s-v)^  1 = (s-v)k_1  if  s > v and  is  zero  otherwise. 
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Given  data  {(x.,y.)}^  one  can  attempt  to  find  t*  and  A*  so  that 

■L  2 r 2 

min  l [s(t,A,x  ) - y , ] = l [s(t*,A*',x  ) - y ] 

t,A  i=l  * 1 . i=l  1 1 

It  is  in  general  impossible  to  find  t*  and  hence  A*;  however,  given  a 
knot  vector  t and  linear  parameters  A one  can  attempt  to  decrease  the 
error  sum  of  squares  by  some  sort  of  descent  algorithm. 

The  method  of  descent  which  we  use  is  essentially  that  of  deBoor  and 
Rice  [2]. 

Algorithm:  Given  £ and  j * k + 1 

Step  1:  Find  A which  minimizes 

L 2 

l fs(t,A,x  ) - y ] 
i=l  ix. 

Step  2:  With  A fixed,  move  t^  between  t^^  and  ^+1  30  as  to  ^e" 

crease  the  error  sum  of  squares.  Call  this  new  knot  sequence  t. 

Step  3:  If  j < n set  j = j + 1,  otherwise  set  j = k + 1. 

Step  4:  Go  to  1. 

This  algorithm  is  described  in  more  detail  in  the  appendix.  We  remark 
that  Step  1 consists  of  solving  a linear  system  and  that  in  Step  2 we  use 
a quadratic  interpolation  to  obtain  an  approximation  to  the  minimum. 

We  originally  had  a Fortran  version  of  the  above  algorithm  and  "trans- 
cribed" this  program  into  Basic  when  implementing  it  on  an  HP  9830A.  Some 
of  the  characteristics  of  the  HP  9830A  are  detailed  in  the  appendix.  Oper- 
ating time  on  the  HP  9830A  was  quite  long.  A typical  problem  with  20  data 
points  and  k = 4 with  2 interior  knots  might  easily  take  thirty  minutes. 

We  discovered  that  most  of  the  time  was  being  spent  in  evaluating  the 
error  sum  of  squares, 

L 2 
l [s(t,A,x  > - y ] , 

i-1  - x x 

This  term  must  be  evaluated  quite  often  during  Step  2 of  the  algorithm. 
Additional  time  was  being  spent  filling  out  the  matrix  to  solve  for  the 
linear  parameters.  The  two  routines  which  were  being  used  to  evaluate  the 
spline  or  fill  out  the  matrix  had  the  following  form* 


54 


Given  ^ T < t^+^t 

620  N[1 , 1]  = 1 

630  FOR  S = 1 To  K - 1 
640  P[S]  - T[I+S]  - T 
650  M[S]  *=  T - T[ 1+1-S] 

660  N[1,S+1]  * 0 

670  FOR  R = 1 TO  S 

680  Z9  » N[R,S]/(P[R]  + M[S+1-R] 

690  K[R,S+1]  - N[R,S+1J  + P[RJ  * Z9 

700  N[R+1,S+1]  * M [ S+l-R]  * Z9 

710  NEXT  R 

720  NEXT  S 

730  RETURN 

This  computes  simultaneously  the  values  of  all  the  non-zero  normalized  B- 
splines  at  the  point  T,  for  the  matrix  computations. 

880  FOR  R - 1 TO  K 
890  P[R]  = T [ I+R]  - T 
900  M[R]  = T - T [ I-K+R] 

910  D[R,1J  * A[ I-K+R] 

920  NEXT  R 

930  FOR  S = 1 TO  K - 1 
940  FOR  R = S + 1 TO  K 

950  D[R,S+1]  * (M[R]  * D[R,S]  + P[R-S]  * D[R~1,S] )/ (M[r]  + P[R-S]) 

960  NEXT  R 

970  NEXT  S 

980  RETURN 

This  computes  s(t,A,T)  for  the  error  sum  of  squares. 

The  parameter  K is  set  at  the  first  of  the  program  and  determines 
the  order  of  the  spline.  Since  K does  not  change  throughout  the  program 
we  decided  to  rewrite  these  routines  in  such  a way  that  no  loops  were 
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involved.  This  simple  expediant  reduced  the  running  time  by  a factor  of 
two  or  three.  We  list  below  the  corresponding  changes  In  the  k = 2 
(linear  spline)  program  and  the  k •»  4 (cubic  spline)  program.  Given 


fci  <■ 

T < 

t. 

i+1  ’ 

For 

k 

2 

1892 

PI 

n 

T£ I+l]  - T 

1894 

Ml 

- 

T - T[ I] 

1896 

Z9 

= 

1/(P1  + Ml) 

1897 

SI 

a 

PI  * Z9 

1898 

S2 

a 

Ml  * Z9 

This  computes  simultaneously  the  values  of  all  the  nonzero  normalized  B- 
splines  at  the  point  T and  stores  them  in  SI  and  S2. 

2788  PI  = T [ 1+1 ] - T 
2792  M2  = T - T[I] 

2802  Cl  = (M2  * A f I ] + PI  * A[I-1])/(M2  + PI) 

This  computes  s(t,A,T)  for  the  error  sum  of  squares.  For  k = 4 we  have 


1730 

PI 

X [ I+l  j - T 

1740 

P2 

a 

T I T.+2  ] - T 

1745 

P3 

e: 

T [ 1-4-3  3 - T 

1750 

Ml 

T ~ T[ I] 

1760 

M2 

- 

T - T[I-2] 

1770 

Z9 

s=’- 

l/(Pl!Fil) 

1780 

SI 

*= 

PI  * Z9 

1790 

S2 

a 

Ml  * Z9 

1810 

Z9 

- 

SI/ (PI  + M2) 

1820 

SI 

= 

PI  * Z9 

1830 

C2 

M2  * Z9 

1840 

Z9 

e 

S2/(P2  + Ml) 

1850 

S2 

- 

C2  + P2  * Z9 

1860 

S3 

- 

Ml  * 29 

w6\ 

Z9 

=s 

SI/ (PI  + M3) 
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1862 

SI 

B5 

PI  * Z9 

1863 

C2 

as 

M3  * Z9 

1864 

Z9 

S3 

S2/(P2  + M2) 

1865 

S2 

ra 

C2  + P2  * Z9 

1866 

C3 

w 

M2  * Z9 

1867 

Z9 

- 

S3/(P3  + Ml) 

1868 

S3 

cat 

C3  + P3  * Z9 

1869 

S4 

SIT 

Ml  * Z9 

This  computes  simultaneously  the  values  of  all  the  nonzero  normalized  B- 
splines  at  the  point  T and  stores  them  in  SI  through  S4. 

2490  PI  » T[ 1+1 j - T 
2500  P2  = If  1+2]  - T 
2503  P3  « T[I+3]  - T 
2510  M2  » T - T [ 1—2 ] 

2520  M3  - T - T[I~1] 

2530  M4  - T ~ T[I] 

2540  Cl  « (M2  * A [1-2]  + PI  * A[I-3] )/ (M2  + PI) 

2550  C2  - (M3  * A [ 1-1 ] + P2  * A[I-2])/(M3  + P2) 

2560  C3  - (M4  * A[ I]  + P3  * A[I-1])/  (M4  + P3) 

2562  Cl  * (M3  * C2  + PI  * Cl)/ (M3  + PI) 

2564  C2  - (M4  * C3  + P2  * C2)/(M4  + P2) 

2566  Cl  - (M4  * C2  + PI  * C1)/(M4  + PI) 

2570  RETURN 

This  computes  s(t,A,T)  for  the  error  sum  of  squares, 

It  is  easily  seen  that^as  K increases  the  length  of  these  new  routines 
increases  by  a factor  of  K , 

3.  CONCLUDING  REMARKS.  When  writing  programs  on  mini, -computers  one 
has  to  be  extremely  careful  not  to  write  the  programs  in  full  generality, 
at  least  for  this  generation  of  mini-computers,  due  to  their  slow  execution 
time.  For  our  particular  problem  we  found  it  necessary  to  write  a special 
program  for  each  order.  Initially  this  takes  more  time,  but  in  the  long 
run  much  time  is  saved. 
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Concerning  our  specific  problem  we  found  that  most  of  the  computer  time 
was  spent  in  the  spline  evaluation  routine.  This  occurred  quite  often 
during  the  computation  of  the  error  sum  of  squares.  Now,  originally  the 
spline  evaluation  was  accomplished  in  two  nested  do-loops  where  the  outer 
parameter  was  K — 1 where  K is  the  order  of  the  spline.  Since  in  our 
applications  K was  fixed  throughout  the  entire  program,  it  made  sense  to 
rewrite  the  program  for  a fixed  K,  thus  eliminating  the  loops  and  lineariz- 
ing the  execution.  The  details  are  in  Section  2. 
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I,  Man/Machine  Interface 

A.  The  program  presented  in  this  paper  was  written  in  the  BASIC 
computer  language  for  the  Hewlett  Packard  Model  9830A  digital  com- 
puter, The  BASIC  language  is  very  similar  to  FORTRAN, 

1.  In  using  the  HP9830,  the  user  is  provided  with  a man/machine 
interface  capability.  With  this  interface  capability,  the  pro- 
gram pauses  at  various  points  in  its  execution  to  allow  the  user 
to  exercise  different  options  such  as  plotting  the  spline,  plot- 
ting the  data  points,  calculating  the  derivative,  etc, 

B , Cassette  Storage  Capability. 

1.  ~The  HP9830A  has  secondary  storage  capability  in  the  form  of 
cassette  magnetic  tapes.  Programs  as  well  as  data  points  can  be 
maintained  on  these  tapes.  Having  this  peripheral  capability  is 
a tremendous  asset  especially  when  dealing  with  large  programs 
and/or  large  volumes  of  data. 

Currently,  there  are  three  seperate  versions  of  the  Spline 
>iiY;ram  being  maintained  on  three  seperate  files.  The  user  can 
select'  from  any  of  these  depending  on  which  degree  Spline  pro- 
gram he  wishes  to  run  (Linear,  Quadratic,  or  Cubic), 

II . Main  Program.  The  main  routine  is  used  only  for  initialization  of 
input  data  and  knot  placement, 

A.  Input  Data.  The  user  is  provided  the  option  of  inputting  data 
through  the  keyboard  or  having  the  program  read  previously  stored 
data  from  a cassette  tape.  With,  either  option,  the  user  must  pro- 
vide the  program  with  two  file  numbers.  If  the  data  is  being  input 
through  the  keyboard,  the  program  will  build  two  new  files  on  the 
cassette  tape, 

1,.  After  the  program  is  loaded  and  running,  the  program  will 
request  the  user  to  input  the  number  of  data  points.  The  request 
will  be  displayed  on.  the  display  panel  and  the  user  simply  types 
in  the  number  and  then  hits  the  execute  switch, 

2.  The  next  program  request  is  for  the  means  by  which  the  data 
Is  to  be  input — either  keyboard  or  cassette  tape.  The  program 
will  display  the  message:  "WANT  TO  LOAD  DATA  FROM  TAPE?".  The 
user's  response  is  either  "Y"  or  "N"  for  yes  or  no.  If  no,  the 
program  will  expect  the  data  to  be  loaded  through  the  keyboard, 
and  if  yes,  from  files  on  the  cassette  tape, 

3.  With  either  option  in  2 above,  the  program  will  next  request 
two  tape  file  numbers — where  the  data  is  to  be  found  or  where  to 
store  the  data.  The  first  number  gives  the  file  location  where 
the  variable  X is  (will  be)  stored  and  the  second  number 
gives  the  file  location  where  the  dependent  variable  Y is 
(will  be)  stored. 

4.  The  next  program  request  will  be  displayed  as:  "NUMBER  OF 

EQUATIONS?",  The  value  of  this  variable  (called  N)  depends  on 
the  number  of  internal  knots  the  user  proposes  to  use  in  his 
run  plus  the  order  of  the  Spline  (degree  + 1)  he  is  using.  For 
example,  if  the  user  is  running  the  Cubic  Spline  Program  with  1C 
internal  knots,  then  the  value  of  N would  be  14. 
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5,  The  next  program  request  is  for  input  of  the  knots.  All  knots 
are  input  in  order  from  1 through  N + K,  where  K is  the  Spline 
order.  The  program  requires  that  K knots  be  stacked  at  each  end 
of  the  interval  over  which  the  Spline  is  being  fit, 

B,  As  soon  as  the  knots  have  all  been  input,  the  program  immediately 
calls  subroutine  EQUATE  to  find  the  unique  Spline  function  that  mini- 
mizes  the  error  or  difference  between  the  input  data  and  the  Spline 
curve  calculated  for  the  knot  sequence  as  initialized.  The  program 
next  calls  a subroutine  to  calculate  the  error  and  this  error  value 

is  printed  when  returned  to  the  main  routine.  At  this  point,  the  pro- 
gram pauses  for  3 seconds  to  allow  the  user  to  halt  the  program  if 
he  wishes  to  branch  to  some  special  subroutine.  If  the  user  fails  to 
halt  the  program  at  this  point,  control  is  given  to  the  knot  moving 
routine  which  starts  the  optimization  process  of  moving  knots  to  re- 
duce  the  error, 

C,  The  program  will  remain  in  the  knot  moving  routine  until  the  user 
halts  the  program.  The  knot  moving  routine  will  pause  periodically 
to  allow  the  user  to  branch  to  another  subroutine  if  he  wishes.  This 
pause  will  occur  at  the  end  of  each  cycle  through  the  knot  moving  rou- 
tine, i*e,,  after  all  interior  knots  have  been  considered  as  candidates 
for  moving, 

D,  Program  Limitations,  The  program  is  currently  set  up  to  handle  a 
maximum  of  50  data  points  and  a max  value  of  30  for  N (humber  of 
equations  as  described  in  II, A. 4,),  This  restriction  is  a program 
limitation  where  array  lengths  are  currently  set  at  50  and  30  respec- 
tively, By  increasing  the  dimension  of  certain  arrays,  the  user  can 
increase  the  max  number  of  data  points  and  value  of  N — machine  limit 
for  dimension  size  is  256.  However,  the  program  run  time  on  this 
machine,  for  large  values  of  N and  large  data  sets  is  tremendous. 

The  program  assumes  the  values  stored  in  the  independent  variable 
array  X and  the  knot  sequence  T are  in  increasing  order. 


III*  Subroutine  Descriptions, 

A.  EQUATE , For  all  three  versions  of  the  Spline  program,  the  sub- 
routine EQUATE  can  be  found  starting  at  instruction  number  550  in  the 
program  listings,  The  structure  for  the  routine  was  taken  from  the 
technical  report  by  C.  deBoor,  [3], 

1.  In  order  to  decrease  the  program  run  time,  a number  of 
changes  were  made  to  the  subroutine  as  presented  by  deBoor  (page 
38  of  [3]),  The  major  changes  include  the  following: 

a.  The  number  of  DO  loops  was  reduced. 

b.  Cholesky's  decomposition  method  was  added  to  solve  the 
system  of  equations, 

c.  Linear  arrays  are  used  to  store  the  diagonal  elements  of 
the  band  matrix  instead  of  two  dimensional  arrays  as  done  by 
deBoor* 

d.  Since  the  matrix  being  generated  is  symmetric,  only  the 
main  and  upper  diagonals  are  generated  and  stored. 
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2.  For  all  chree  versions  of  the  program,  the  main  diagonal  and 
first  off-diagonal  vector  of  the  coefficient  matrix  are  stored  in 
arrays  C and  P respectively.  The  constant  vector  is  stored 
in  array  B for  the  system.  The  second  off-diagonal  vector  is 
stored  in  M for  the  quadratic  and  Cubic  versions  and  the  third 
off-diagonal  vector  is  stored  in  array  Q for  the  Cubic.  The 
system  of  equations  in  matrix  notation  has  the  following  form: 
(for  Quadratic  only) 


C1  P1  M1 
px  c2  p2  m2 

Mi  P,  C3  P3  M3 

• \\\v 

\ \ 


\ 


\ 


\ 


\ 


\ 


M 


m-2 


ta-1  ' 


^Mo'P.'c 
m-2  ml  m 


m 


m 


or  simply,  Ax  * B. 

Since  the  matrix  A'  is  real,  symmetric  and  positive  definite, 
then  it  is  possible  to  factor  A as  LL^  where  L is  a real 
lower-triangular  matrix.  Then  Ax  * B becomes  LL^-x  ■ B. 
Letting  y = L^x,  then  we  can  solve  for  y in  Ly  ■ b by  for- 
ward substltutipn  and  finally  we  can  solve  for  x in  L^x  » y 
by  back  substitution.  This  scheme  is  known  as  Cholesky's  Decom- 
position method  and  is  programmed  directly  into  the  EQUATE  sub- 
routine. 

B.  Plotting  Routines.  The  plotting  routines  are  seperate  subrou- 
tines and  are  not  called  by  the  main  program  or  any  other  routine. 
However,  they  do  require  that  the  spline  coefficients  be  generated 
before  being  called.  Also,  before  they  can  be  used,  the  plotter 
must  be  brought  on-line  and  properly  adjusted, 

1.  Spline  Graph.  This  routine  provides  the  user  with  a graph 
of  the  spline  function  based  on  the  spline  coefficients  that 
are  currently  in  memory.  The  user  will  be  asked  to  specify  the 
scaling  to  use  for  the  graph  and  step  size  (distance  between 
calculated  points— the  smaller  the  step  size  the  smoother  the 
graph) . 
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a.  The  first  program  request  reads;  "4  SCALE  VALUES?".  The 

user  inputs  4 values:  First  two  are  the  limits  on  the  x-axis 

of  the  graph  and  the  second  two  are  the  limits  on  the  y-axis 
of  the  graph.  For  example,  if  we  know  that  our  input  data  for 
the  x-axis  spans  values  from  0 to  100  and  we  expect  our  spline 
function  to  not  vary  outside  the  limits  of  -10  to  + 10,  we 
might  input  the  following  values:  0,  100,  -10,  10. 

b.  The  second  program  request  reads:  "2  VALUES  X-CROSS  AND 

XTIC?".  The  user  inputs  the  point  he  wants  the  x-axis  to 
cross  the  y-axis  and  the  spacing  for  the  Tic  marks  on  the 
x-axis.  An  input,  for  the  example  given  above,  might  be: 

0,  10. 

c.  The  next  program  request  reads:  "2  VALUES  FOR  Y-CROSS 

AND  YTIC?".  The  user  inputs  the  point  he  wants  the  y-axis 
to  cross  the  x-axis  and  the  spacing  of  the  Tic  marks  on  the 
y-axis.  For  the  example  given  above,  an  input  might  be: 

0,  1.  With  the  example  Inputs  given  in  a,  b,  and  c above, 
the  program  would  sketch  the  x and  y axis  as  follows: 


4*— * — t— * — t — +— » — ► 


d.  After  having  the  axis  plotted,  the  program  will  next  ask 

for  the  step  size  (Quadratic  and  Cubic  version  only) . The 
display  simply  reads:  "STEP  = ?".  If  the  user  requires  a 

smooth  curve,  the  step  size  should  be  small  compared  to  the 
overall  length  of  the  curve.  Usually  100  to  200  points  pro- 
vides a very  smooth  graph.  For  the  example  given  above,  a 
step  size  of  one  would  provide  100  graph  point.3  of  the  spline 
function  connected  by  straight  lines  which  should  provide 

the  appearance  of  a fairly  smooth  curve.  For  the  linear  spline 
program,  a step  size  is  not  required  because  the  program  sim- 
ply joins  the  knots  with  straight  lines, 

e.  After  the  spline  graph  is  completed,  the  program  will  next 
automatically  sketch  in  the  locations  of  all  interior  knots, 

2,  Data  Point  Plot.  To  plot  the  data  points  is  an  option  that 
is  usually  exercised  after  the  Spline  Graph  has  been  completed. 

This  plot  provides  an  "overlay"  of  where  the  input  data  lie  with 
respect  to  the  Spline  graph.  The  routine  simply  steps  through 
each  data  point  and  plots  a small  "x"  at  the  coordinates  of  the 
point. 

3.  Error  Plot.  This  routine  provides  a graphical  representation 
of  the  error  at  each  data  point.  The  error  is  the  difference  be- 
tween the  input  data  point  value  and  Spline  Function  value  for 
the  same  point.  The  routine  does  its  own  scaling  based  on  input 
data  and  on  the  square  root  of  the  sum  of  squares  of  the  error 
taken  over  all  data  points.  Tic  marks  are  provided  for  each  axis 
and  the  value  of  each  is  printed  on  the  printer. 
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C.  Knot  Moving  Routine.  Once  the  Main  routine  transfers  control 
to  the  knot  moving  routine,  this  routine  will  stay  in  an  endless 
loop  constantly  trying  to  reduce  the  error.  The  only  way  the  pro-* 
gram  will  stop  is  by  operator  intervention.  The  program  allows 
such  intervention  at  the  end  of  each  cycle  through  the  knot  sequence, 
i.e.,  after  all  knots  have  been  considered  as  candidates  for  moving. 
At  the  end  of  each  cycle,  the  program  pauses  for  three  seconds  dis- 
playing the  message  "PLOT”.  If  the  user  fails  to  stop  the  program 
at  this  point,  the  routine  branches  to  the  beginning  and  starts  the 
cycle  all  over  again  in  moving  knots, 

1«  Logic  of  Knot  Moving  Routine.  This  routine  considers  each 
knot,  in  order,  as  a possible  candidate  for  moving*  The  routine 
will  not  change  the  position  of  a knot  unless  the  new  position 
decreases  the  overall  sum  of  squares  of  the  errors, 

a.  The  routine  starts  at  tie  current  position  of  the 
knot  and  steps  in  the  direction  of  decreasing  error.  The 
step  size  El  is  determined  by  the  variable  H8  which  is 
set  to  128  by  default  (user  can  adjust  this  value  if  he 
wishes— set  in  instruction  120  of  all  versions  of  the  pro- 
gram), El  is  initially  set  to  1/H8  of  the  distance  between 
candidate  knot  X5  and  adjacent  knot.  Call  the  interval  Z. 

b.  The  candidate  knot's  new  position,  X4,  is  determined  by 
X4  = X5  + El.  El  can  be  positive  or  negative  depending  on 
which  interval — left  or  right  of  current  knot — is  considered. 

c.  If  position  X4  yields  an  improved  error  value  over  X5 , 
then  El  is  doubled  and  a third  candidate  position  X3  is 
examined  (X3  = X4  + El) . 

d.  If  position  X3  gives  a smaller  error  value,  we  then  con- 
tinue stepping  through  the  interval  Z toward  the  adjacent 
knot  until  we  either  step  outside  Z or  until  a candidate 
position  X3  produces  an  error  greater  than  the  error  at 
position  X4 . If  this  happens,  graphically,  we  have  the 
following  condition:j 
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We  know  that  we  are  in  the  vicinity  of  a minimum.  We  can 
approximate  the  position  of  that  minimum  by  simply  fitting 
a quadratic  through  the  points  Qj5,A5),  (X4,A4),  (X3,A3) 
and  solving  for  the  minimum. 

e.  After  having  found  a new  location  for  a knot,  we  call 
EQUATE,  calculate  the  new  sum  of  squares  for  the  errors, 
and  loop  on  next  knot. 

Dr  Derivative  Routine.  This  routine  calculates  and  prints  the 
zeroth  through  the  second  derivative  of  the  Spline  Function  for  60 
equally  spaced  points  along  the  x-axis  for  the  entire  span  of  data. 
The  value  60  is  set  by  default  in  the  program  and  the  user  may 
change  the  value  if  he  wishes  (instruction  9050  of  all  versions  of 
the  program).  This  routine  is  based  on  the  formulae  in  [1]. 
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IV,  Examples  of  Output  from  the  Program.  Output  from  two  separate  runs 
are  presented. 

A,  The  first  run  uses  the  Quadratic  version  of  the  Spline  program. 
Twenty  data  points  are  used  and  two  internal  knots-  The  spline  was 
graphed  before  any  knots  were  moved  and  again  after  three  cycles 
through  the  knot  moving  routine*  The  error  plot  routine  was  also 

run  after  the  knots  were  moved.  Figure  1 shows  output  to  the  printer* 
1*  When  the  Quadratic  version  of  the  program  was  executed , the 
first  two  messages  printed  were: 

"QUADRATIC  SPLINE  PROGRAM" 

"K  EQUALS  3"  . 

2*  The  program  next  requested  the  number  of  data  points.  The 
number  20  was  input  and  the  program  printed: 

"L9  EQUALS  20". 

(L9  is  a program  variable  used  for  storing  data  point  count) . 

3.  The  data  was  loaded  via  cassette  tape  and  printed. 

4.  The  next  request  was  for  the  number  of  equations;  the  number 
5 was  input  and  "N  “ 3"  was  printed. 

5.  The  program  next  requested  the  input  of  knots  and  each  was 
printed  by  the  program  when  input. 

6.  After  all  knots  were  input,  the  program  calculated  the  ini- 
tial sum  of  squares  of  the  error  and  printed  the  message:  "SUM 

OF  SQUARES  = 10,24179658".  At  this  point,  the  program  was 
halted  and  control  transferred  to  the  Spline  Graph  Routine.  The 
first  graph  (labeled  Plot  1 in  fig.  2. a)  was  accomplished.  The 
Data  Point  Plot  was  also  accomplished  at  this  point. 

7.  Control  was  then  given  to  the  knot  moving  routine.  As  each 
knot  was  moved,  the  new  location  and  the  new  sum  of  squares  of 
the  error  were  printed.  As  can  be  seen  in  figure  1,  the  two 
internal  knots  were  moved  from  2.1  and  7.0,  as  input,  to 
3.213337131  and  6.893706417  with  a 10  fold  reduction  in  the  error. 
Three  cycles  through  the  knot  moving  routine  were  required  to 
accomplish  this  reduction, 

8.  The  program  was  halted  at  this  point  and  control  transferred 
to  the  Spline  Graph  routine  again.  The  new  Spline  Graph  (Plot 

2 of  fig,  2, a)  was  accomplished  and  the  results  show  a much  bet- 
ter fit  to  the  input  data, 

9.  After  the  Spline  Graph  was  completed,  an  error  plot  was  done 
(fig.  2.b).  The  scaling  was  done  automatically  and  the  message: 

"XTIC  - 0.855  YTIC  ■ 0,458175936"  provides  the  user 

with  the  value  of  the  X and  Y tic  marks. 

B,  The  second  example  used  the  same  data  set  but  was  run  using  the 
Linear  Spline  program  and  six  internal  knots. 

1.  The  output  from  the  printer  is  shown  in  figure  3 and  the 
plotted  output  in  figure  4, 

2.  The  program  was  again  allowed  to  cycle  three  times  through 
the  knot  moving  sequence.  The  plots  were  made  after  the  third 
cycle. 

C,  Figure  5 provides  an  example  of  output  for  the  derivative  rou- 
tine, The  routine  calculates  and  prints  the  zeroth,  first,  and 
second  derivative  for  twenty  equally ‘spaced  points  along  the  Quad- 
ratic Spline  curve  generated  for  example  1 above. 
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Figure  1 : 


rRI.NT.2R  OUTPUT  FOR  EXAMPLE  1 


QUADRATIC  SPLINE  PROGRAM 

K EQUALS  5 
L9  EQUALS  20 

1 

0.45 

0.532832779 

o 

0.9 

1.135824019 

'5 

1.35 

1.634336624 

4 

1.8 

1.850310499 

5 

2.25 

1.653405543 

6 

2.7 

1.004342719 

7 

3.15 

-0.021648662 

8 

3.6 

-1.239057241 

9 

4.05 

-2.385288895 

10 

4.5 

-3.176972882 

11 

4.95 

-3.377363166 

12 

5.4 

-2.859228  04 

13 

5.85 

-1.64757377 

14 

6.3 

0.069777687 

15 

6.75 

1 .968942823 

16 

7.2 

3.650872173 

17 

7.65 

4.724967051 

18 

8.1 

4.897943545 

19 

8.55 

4.04801103 

20 

9 

2.266651669 

N=  5 

T(I  )'=  0 

1 

T ( I ) = 0 

2 

T ( 0 - 0 

3 

T(l )=  2.1 

4 

T(I )=  7 

5 

T (I )=  9.3 

6 

T(I)=  9.3 

7 

T(l)=  9.3 

8 

SUM  OF  SQUARES 

= 10.24179658 

2.705905100 

3.776486574 

7.023652237 

3.765599501 

3.026696625 

1 .836770835 

6.964153151 

1.763237007 

3.213337131 

1.055100397 

6.893706417 

0.951943319 

XTIC  - 0*855  YSIC  = 0. toy 09367 4 
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XTIC=  0.855 
YTIC=  0.109083874 


Figure  3:  PRINTER  OUTPUT  FOR  EXAMPLE  2 


LINEAR  SPLINE  PROGRAM 
K EQUALS  2 
L9  EQUALS  20 


N=  8 

T(I)  = 0 
T(I)=  0 
T(l)=  2.5 
T(I)=  3.3 
T(I}«  4.5 
T(I)=  5.5 
T(I)-  7.5 
T(I)*  8.5 
T(l )=  9.3 
T(I)=  9.3 


1 

2 

3 

4 

5 

6 

7 

8 

9 

10 


SUM  OF  SQUARES 

2.312506664 

3.3 

4.530897345 

5.514309408 

7.508981167 

8.468580843 

2.312506664 

3.3 

4.562157505 

5.524269999 

7.508981167 

8.439467487 

2.312506664 

3.3 

4.590545267 

5.532607619 

7.508981167 

8.41376275 


* 0.508496390 

0.507836829 

0.507836829 

0,494072159 

0.487454070 

0.484908316 

0.450663854 

0.450663854 

0.450663854 

0.437636837 

0.434330461 

0.434330461 

0.408240592 

0.408240592 

0.408240592 

0.398339792 

0.395976608 

0.395976608 

0.3778665422 
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Figure  5:  EXAMPLE  OP  OUTPUT  FROM  DERIVATIVE  ROUTINE  FOR 

20  EVENLY  SPACED  POINTS 


SPLINE 


X- VALUE 

VALUE 

0 

-1.276456667 

0.465 

0.272212206 

0.93 

1.336361559 

1.395 

1.915991394 

1 .86 

2.011101709 

2.325 

1.621692505 

2.79 

0.747763783 

3.255 

-0.610305260 

3.72 

-1.934006303 

4.185 

- 2.758326289 

4. 65 

-3.08326522 

5.115 

-2.908823095 

5.58 

-2.234999914 

6.045 

-1.061795678 

6.51 

0.610789615 

6.975 

2.759180793 

7.44 

4.389483645 

7.905 

<.  4.976479164 

8.37 

4.520167350 

8.835 

3.020548203 

9.3 

0.477621723 

FIRST  SECOND 

DERIVATIVE  DERIVATIVE 

3.85H59424  -2.240811742 

2.809481963  -2.240811742 

1.767504503  -2.240811742 

0.725527043  -2.240811742 

-0.316450417  -2.240811742 

-1.358427878  -2.240811742 

-2.400405338  -2.240811742 

-3.383637786  2.309543558 

-2.309700031  2.309543558 

-1.235762277  2.309543558 

-0.161824522  2.309543558 

0.912113232  2.309543558 

1.986050987  2.309543558 

3.05998874  2.309543558 

4.133926495  2.309543558 

4.62786348  -4.825100395 

2.384191797  -4.825100395 

0.140520113  -4.825100395 

-2.103151571  -4.825100395 

-4.346823255  -4.825100395 

-6.590494939  -4.825100395 
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REM 
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96  M “X  "Hr 


REM 

REM 

REM 

REM 

DIM 

DIM 

K=2 

REM 


***** 

********************************* 

SPLINE  GRAPH  7000 

ERROR  GRAPH 7380 

DATA  POINT  PLOT 7290 

RETURN  TO  KNOT  MOVING  ROUTINE  7640 


Til  NEAR  SPLINE  PROGRAM 
SUBROUTINE  JUMP . ADDRESSES 
1 


DERIVATIVE  ROUTINE 


9000 


30],e[  30] ,X[ 50] ,G[ 50]  ,H$r  1 ] 

6, 6 j ,T[ 30] ,P[ 30 ] ,M[ 30] ,D[ 6, 6] ,A[ 30] , C[30] , S[ 6] 


****  H8  USED  IN  KNOT  MOVING  ROUTINE  *** 


120  H8=1 28 

130  PRINT  "LINEAR  S LINE  PROGRAM" 

140  PRINT  "K  EQUALS »K 

150  DISP  "NUMBER  OP  DATA  PTS"; 

160  INPUT  L9 
170  PRINT  "L9  EQUALS "L9 
180  DISP  "WANT  TO  LOAD  DATA  PROM  TAPE"; 
190  INPUT  H3 
200  IP  H$="Y"  THEN  300 
210  DISP  "2  PILE  #S  WHERE  DATA  GOES"; 
220  INPUT  A8,A9 
230  FOR  1=1  TO  L9 
40  DISP  "INPUT  X,Y"I; 

250  INPUT  X[I  l,G[l] 

260  NEXT  I 

270  STORE  DATA  A8,X 
280  STORE  DATA  A9,G 
29 0 GOTO  370 

300  DISP  "2  PILE  #S  WHERE  DATA  IS"; 

310  INPUT  A8,A9 
320  LOAD  DATA  A8,X 
330  LOAD  DATA  A9,G 
340  DISP  "PRINT  DATA"; 

350  INPUT  m 
360  IF  HS#"Y"  THEN  400 
370  FOR  1=1  TO  L9 
380  PRINT  I , X[ I ] , G[I ] 

390  NEXT  I 

400  DISi  "NUMBER  OP  EQUATIONS=?" ; 

410  INPUT  N 

420  N1--N-1 

430  PRINT  "N="N 

440  FOR  1=1  TO  N+K 

450  DISP  "KNOT  IN  T(I)=?"I; 

460  INPUT  T[ I ] 

470  PRINT  "T(I)="T[I],I 
480  NEXT  I 
90  GOSUB  550 
500  GOSUB  8070 

510  PRINT  "SUM  OF  SQUARES  ="S1 
520  DISP  "PLOT?"; 

530  AIT  3000 
5’>0  GOTO  7640 
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550  ’’EM  * ***********suBR0UTINE  EQUATE********************************** 
570  FOR  11=1  TO  N 
580  B[I1]=C[I1 ]*P[I1 ]-0 
610  NEXT  II 
620  I-K 
650  11=0 

640  FOR  L=1  TO  L9 
650  IF  I=N  THEN  700 
660  IF  X[L]<T[I+1]  THEN  700 
670  1=1+1 
680  II =I-K 
690  GOTO  650 
700  T=X[L] 

710  P1=T[l+1 1-T 
720  Ml =T“T[l ] 

750  Z9=1 / (P1+M1 ) 


740 

3 

1=P1 

*Z9 

750 

S2=M1 

*29 

760 

G1=G[L] 

770 

12=11 

+ 1 

780 

B 

'12 

=S1*G1+B 

'12' 

790 

C 

*12' 

=S1*S1+C 

*12" 

800 

P 

'I2*1 

=S1*S2+P 

;i2] 

810 

12=12+1 

820 

B 

'12. 4 

=S2*G1+B 

[121 

850 

C 

’l2. 

=S2*S2+C 

[12] 

840 

NEXT 

L 

850  REM  ***************  BANDED  MATRIX  ROUTINE  *************************-! 
860  REM  SOLVES  SYSTEM  USING  CHOLESKY'S  DECOMPOSITION  METHOD 
870  FOR  J=1  TO  N1 
880  J1=J+1 

J]=D=SQR(C[J]) 

[j1=di=p[j]/d 

J 1 ] — C[J1 ]-D1*D1 


890  E 
900  P 
910  C 


920  n: 


950 

940 


w 


CXT 


J 

I N]=SQR 


/ 

TO  N 


1 


950  FOR  J=2 
960  J1=J-1 

970  C[j]=(B[j]-P[J1 ]*C[J1 ])/e[ J] 

980  NEXT  J 

990  A[N]=C[nJ/E[N] 

1000  FOR  J=N1  TO  1 STEP  -1 
1010  J1=J+1 

1020  A[J]=(C[ J]-P[J]*A[J1 ])/E[j] 
1050  NEXT  J 
1040  RETURN 
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7000  HEM*  x **  * x-x-x  *3PLINE  GRAPH*  ***  **********  **********  *********  *******  ****** 
7010  .DIOP  "4  SCALE  VALUES"; 

7020  INPUT  Q9,Q8,Q7,Q6 
7030  SCALE  Q9,Q3,Q7,Q6 
7040  DIOP  "2  VALUES  X-CROSS  AND  XTIC"; 

7050  INPUT  Q7,Q6 
7060  XAXIS  07, Q6 

7070  DISP  "2  VALUES  FOR  Y-CROSS  AND  YTIC"; 

7080  INPUT  Q7,Q6 
7090  YAXIS  Q7,Q6 
7110  I=K 

7120  FOR  J =2  TO  N+1 
7130  T=T[ j] 

7140  GOSUB  8930 
7150  GOSUB  7550 
7160  PLOT  T, Cl 
7170  NEXT  J 
7180  I=K 

7190  FOR  Q6=K+1  TO  N 
7200  REN 
7210  T=T[Q6] 

7720  GOSUB  8930 
7230  GOSUB  7550 
7240  PLOT  T, Cl 
7250  CPLOT  -0.3, -0.3 
7760  LABEL  (*)»0" 

7270  NEXT  Q6 
7280  STOP 

7290  REM  ******  DATA  POINT  PLOT  **************************************** 

7300  FOR  11=1  TO  L9 
7310  PLOT  X[Il]fG[Il] 

7320  CrLOT  -0.3, -0.3 
7330  LABEL  (*)"X" 

7350  PEN 
7360  NEXT  11 
7370  STOP 

7380  REM  *****  ERROR  PLOT  ROUTINE  ************************************ 

7390  Y=3*SQR(S1/L9) 

7400  SCALE  X[l],X[L9],-Y,Y 
7410  XAXIS  0, (X[L9]-X[1])/10 
7420  YAXIS  X[ 1 ] , Y/6 

7P30  PRINT  "XTIC  ="(X[L9]-X[ 1 ])/10"YTIC="Y/6 

7440  I=X 

7450  FOR  J=1  TO  L9 
7460  T=X[ J ] 

7470  GOSUB  8930 
7480  GOSUB  7550 
7490  PLOT  T, G[ J ] — C 1 
7495  PLOT  T, 0 
7500  PEN 
7510  NEXT  J 
7520  STOP 
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7530  REM  *******  CALCULATE  SPLINE  VALUE  FROM  SPLINE  COEFFICIENTS  ******* 
7550  Pi =T[ 1+1 1-T 
7570  M2=T-T[Ij 

7600  Cl  = (M2* A[I ]+P1*A[l-1 ] )/(M2+P1 ) 

7650  RETURN 

7640  REM* * * * * **  * ****  ***KNOT  MOVING  SUBROUTINE*************************** 

7650  FOR  1 5 = 3 TO  N 

7660  :$1«(T[I5+1  ]-T[l5])/H8 

7670  A5=S1 

7630  X5-T[I5] 

7690  T[I5J=a4=T[I5]+E1 
7700  GOSUB  8070 
7710  IF  S1<A5  THEN  7770 
7720  E1=(T[I5-1 ]-X5)/H8 
7730  T[I5]=X4=X5+E1 
7740  GOSUB  8070 
7750  IF  SKA 5 THEN  7770 
7760  GOTO  7990 
770  a4=S1 
7780  E1=E1+B1 
7790  T[l5 ]=X3=X4+E1 

7800  IF  (X3<T[I5-1 ] OR  X3>T[I5+1 ] ) THEN  7990 
78l 0 GOSUB  8070 
7820  A3=S1 

7830  IF  (A3>A4)  THEN  7890 

7840  A5=A4 

7850  X5=X4 

7860  A4=A3 

7870  X4=X3 

7880  GOTO  7780 

7890  X6=X3*X3 

7900  X7=X4*X4 

7910  X8=X5*X5 

7920  R1 =A5* (X6-X7 )+A4* (X8-X6)+A3* (X7-X8) 

7930  R2=(A5* (X3-X4)+A4*(X5-X3)+A3* (X4-X5) )*2 

.7940  T[I5]=R1/R2 

7950  GOSUB  8070 

7960  IF  (S1<A4)  THEN  8000 

7970  T [ 1 5 ] =X4 

7980  GOT  8000 

7990  T[I5]=T[I5]-E1 

8000  GOSUB  550 

8010  GOSUB  8070 

8020  PRINT  T [ 1 5 ] * 3 1 

8030  NEXT  15 

8040  DI5P  "PLOT?"; 

8050  WAIT  3000 
8060  GOTO  7650 
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8070  REM***********C0MPUTES  SUM  OF  SQUARES************************ 

8080  SI =0 
8090  I=K 

8100  FOR  L=1  TO  19 
8110  T-X[L] 

8120  IF  T[l+l]  >=  T THEN  8170 
8130  1=1+1 

8140  IF  I <=  N+1  THEN  8120 
8150  PRINT  "RANGE” 

8160  STOP 
8170  P1=T[l+1 1-T 
8190  M2=T-T[IJ 

8220  C1=(M2*A[l]+P1*A[l-l])/(M2+Pl) 

8250  S=01-G[LJ 
8260  S1=S1+S*S 
8270  NEXT  L 
8280  RETURN 

8930  REM***  FIN  S I SUCH  THAT  T(l)<=T<T(l+1 ) USING  FORWARD  SEARCH  ONLY** 
8940  IF  T[l+1 ] >=  T THEN  8990 
8950  1=1+1 

8960  IF  I <=  N+1  THEN  8940 
8970  PRINT  "RANGE"; 

8980  STOP 
8990  RETURN 
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9000  KI5M  ************  DERIVATIVES********************* 

§010  REM  THIS  ROUTINE  CALCULATES  THE  ZEROTH  THROUGH  D1  DERIVATIVES  OF 
9020  REM  AND  PLACES  THEM  IN  P(l)  THROUGH  p(D1  + l)  RESP.  SPUt^ 

9030  REM  THE  DERIVATIVES  ARE  AT  T 
9040  D1 =3 

9050  H1=(T[N+l]-T[K])/60 

9060  FOR  T=T[Kj  TO  T[N+1 ] STEP  HI 

9070  I=K 

9080  GOSUB  8930 

9090  REM**********************  CALCULATES  N(l,K)  AT  T***************** 

9100  N[ 1 , 1 ]=1 

9110  FOR  S= 1 TO  K-1 

9120  P[ S]=T[l+S]-T 

9130  M’SJ=T-T[I+1-S] 

9140  N[  ,S+1 ]=0 

9150  FOR  R=1  TO  S 

9160  Z9=N[R,S]/(P[R]+M[S+1~R]) 

9170  n[r, 3+1 ]=N[R, S+1 ]+P[R]*Z9 

9180  N[R+1 ,S+1]=M[S+1-R]*Z9 

9190  NEXT  R 

9200  NEXT  S 

9210  FOR  S=1  TO  K 

9220  D[S, 1 ]=A[l-S+1 ] 

9230  NEXT  S 
9240  FOR  S=2  TO  K 
9250  FOR  R=S  TO  If 

9260  D[R,S]=(D[R-1,S-l]-D[R,S-1])/(T[l+K-R+l]-T[I-R+S])  ' 

9270  NEXT  R 
9280  NEXT  S 
9290  Ml =1 

9300  FOR  1=1  TO  D1+1 
9310  31=0 

9320  FOR  J=1  TO  K-I+1 

9330  S1=S1+D[K+1-J,I]*N[J,K+1-I] 

9340  NEXT  J 
9350  P[I ]=S1*M1 
9360  M1=M1*(K-I ) 

9370  NEXT  I 

9380  PRINT  T,P[1],P[2],P[3] 

9390  NEXT  T 
9400  STOP 

9500  REM  ***  SPLINE  FUNCTION  FNS(T)  *********************************** 
9510  DEF  FNS(T) 

9520  I=K 
9530  GOSUB  8930 
9540  GOSUB  7550 
9550  RETURN  Cl 
9560  END 
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I##* 


1C  i 
20  REM  ** 
30  ROM 
4 0 REM 
50  REM 
60  R M 
70  REM 
80  DIM  B 
90  DIM  N 


**  QUADRATIC  SPLINE  PROGRAM  **** 

***  SUBROUTINE  JUMP  ADDRESSES 

(1)  SPLINE  GRAPH  

(2)  ERROR  GRAPH  

(5)  DATA  POINT  PLOT  

(4)  RETURN  TO  KNOT  MOVING  ROUTINE 

(5  DERIVATIVE  ROUTINE  — 

30].E[30].X[50],G[50],H3[1] 
6,6],T[30j,P[30],M[50],D[6,6],A[30],C[30]fS[6] 


******************-********-***** 


7000 

7380 

7290 

7640 

9000 


100  K = 3 

110  REM***  H8  USED  IN  KNOT  MOVING  ROUTINE  *** 

120  HQ=1 28 


130  PRINT  "QUADRATIC  SPLINE  PROGRAM" 

140  PRINT  "K  EQUALS  K 

150  DISP  "NUMBER  OP  DATA  PTS"; 

160  INPUT  L9 

17  PRINT  "L9  EQUALS "L9 

180  DISP  "WANT  TO  LOAD  DATA  PROM  TAPE"; 

190  INPUT  H$ 

200  IF  HQ="Y"  THEN  300 

210  DISP  "2  PILE  //S  WHERE  DATA  GOES"; 

220  INPUT  A8,A9 

230  FOR  1=1  TO  L9 

40  DSP  " NPUT  X, Y"I ; 

250  INPUT  X[l],G[l] 

260  NEXT  I 

270  STORE  DATA  A8,X 
280  STORE  DATA  A9,G 
290  GOTO  370 

300  DISP  "2  PILE  #S  ’WHERE  DATA  IS"; 

310  INPUT  A8,A9 
3 0 LOAD  DATA  A8,X 
330  LOAD  DATA  A9,G 
340  DISP  "PRINT  DATA"; 

350  INPUT  HI 
360  IP  HQ#»Y"  THEN  400 
370  FOR  1=1  TO  L9 
380  PRINT  I,X[I],G[I] 

390  NEXT  I 

400  DISP  "NUMBER  OP  EQUATI0NS=?" ; 

410  INPUT  N 


420  N1 =N-1 


425  N2=N-2 

430  PRINT  "N="N 

440  FOR  1=1  TO  N+K 

450  DISP  "KNOT  IN  T(l)=?"I; 

460  IN  "UT  T[  I ] 

470  PRINT  "T (I ) = "T[l  ] , I 
480  NEXT  I 
490  G03UB  550 
500  GOSUB  8070 

510  PRINT  "SUM  OP  SQUARES  ="S1 
520  DISP  "PLOT?"; 

530  WAIT  3000 
540  GOTO  7640 
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350  REM  *************SUBR0UTINE  EQUATE**********************************' 

560  FOR  11-1  TO  N 

570  B[I1 ]=0[I1 ]=P[I1 ]=M[I1 ]=0 

610  NEXT  II 

620  I=K 

650  11=0 

640  FOR  L= 1 TO  L9 
650  IF  I=N  THEN  700 
660  IF  X[L]<T[I+1]  THEN  700 
670  1=1+1 
600  I 1=I-K 
690  GOTO  650 
700  T=X[L] 

710  P1=T[l+1 1-T 
720  P2=T[I+2 1-T 
730  Ml =T-T[l 1 
740  M2=T-T[I-1 ] 

750  Z9=1 /(P1+M1 ) 

760  S1=P1*Z9 
770  C2=M1*Z9 
780  32=C2 
790  Z9=S1 / (P1+M2) 

800  S1=P1*Z9 
8 0 C2=M2*Z9 
820  Z9=S2/ (P2+M1 ) 

830  02=C2+P2*Z9 
840  33=M1*Z9 
850  G1=G[L] 

860  12=11+1 

870  B[I2]=S1*G1+B[I2 

880  C 12'  S1*S1+C'I2' 

890  P[I2  =S1*S2+P'I2 
900  M[I2]=S1*S3+M[I2 
910  12=12+1 
920  B[I2]=S2*G1+B[I21 
930  C 12 ]=S2*S2+C[I2 J 
940  i![I2=S2*S3+P[I2] 

950  12=12+1 
960  B[I2 J =33*G1+B[l2 1 
970  C[I2J=S3*S3+C[I2J 
980  NEXT  L 
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990  REM  ***************  banded  matrix  ROUT  NE  *************************** 

1000  ROM  SOLVES  SYSTEM  USING  CHOLESKY'S  DECOMPOSITION  METHOD 

1010  FOR  J=1  TO  N2 

1020  J1=J+1 

1030  J2=J+ 

1040  B[ J ]=D=SQR(C[ J ] ) 

1050  P J" =D1 =P[J 1/D 
1060  M* JJ=D2=M[J J/D 
1070  C[ J1 1=g[ji]-di*di 
1080  P'JI '=P’J1 '-D1*D2 
1090  C[J2J=cfj2j -D2*D2 
1100  NEXT  J 

1110  .'S[N1  1=d=sqr(c[ni  ]) 

1120  p'ni ]=di=p[ni j/D 
1130  E'N]=SQR(C[Nl-D1*D1) 

1140  C[l1=B[l]/E[l] 

1150  C[2]=(B[2]-C[1]*P[1])/E[2] 

1160  FOR  J=3  TO  N 
1170  J1=J-1 
1180  J2=J-2 

1190  C[J1=*(B[J]-M[J2]*C[J2]-P[J1]*C[J1])/E[J] 

1700  NEXT  J 

1210  A[N]=C[N]/E[N] 

1220  A[N1 ]=(C[N1]-P[N1 ]*A[N])/E[N1 ] 

1230  POR  J=N2  TO  1 STEP  -1 
1240  J1=J+1 

1250  J2=J+2  * 

1260  A[j]=(C[j]-P[j]*A[Jl]-M[j]*A[J2])/E[j] 

1270  NEXT  J 
1280  RETURN 
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7000  ********  SPLINE  GRAPH******************************************* 

7010  DIOP  "4  SCALD  VALUES”; 

7020  INPUT  Q9,Q8,Q7,Q6 
7050  SCALE  Q9,Q8,Q7,Q6 
7040  DISr  ”2  VALUES  X-CROSS  AND  XTIC"; 

7050  INPUT  Q7,Q6 
7060  7- AX. IS  Q7,Q6 

7070  DISP  ”2  VALUES  FOR  Y-CROSS  AND  YTIC”; 

7080  INPUT  Q7,Q6 
7090  YAXIS  Q7,Q6 
7100  DISP  "STEP  =”; 

7110  INPUT  Q5 
7120  I=K 

7130  FOR  T=Q9  TO  Q8  STEP  Q5 

7140  GOSUB  8930 

7150  GOSUB  7550 

7160  PLOT  T,  Cl 

7170  NEXT  T 

7180  I=K 

7190  FOR  Q6=K+1  TO  N 

7200  PEN 
7210  T=T[Q6] 

7220  GOSUB  8930 
723 0 GOSUB  7550 
7240  PLOT  T,  Cl 
7250  CPLOT  -0.3, -0.3 
7260  LABEL  (*)"0M 
7270  NEXT  Q6 
7280  STOP 

7290  REM  ******  DATA  POINT  PLOT  **************************************** 
7300  FOR  11=1  TO  L9 
7310  PLOT  X[I1],G[I1] 

7320  CPLOT  -0.3, -0.3 
7330  LABEL  (*)»X" 

7350  PEN 
7360  NEXT  II 
7370  STOP 

7380  REM  *****  ERROR  PLOT  ROUTINE  ************************************ 

7390  Y=3*5QR(S1/L9) 

7A00  SCALE  X[1],X[L9],-Y,Y 
7410  YAXIS  0, (X[L9]-X[1])/10 
7420  YAXIS  X[1],Y/6 

7450  PRINT  "XTIC  =" (X[L9]-X[ 1 1 ) /10”YTIC=”Y/6 

7440  l=K 

7450  FOR  J=1  TO  L9 
7460  T=X[ J ] 

7470  GOSUB  8930 
7430  GOSUB  7550 
7490  PLOT  T,G[J]-C1 
7495  PLOT  T,0 
7500  PEN 
7510  NEXT  J 
7520  STOP 
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7530  REM  *******  CALCULATE  SPLINE  VALUE  PROM  SPLINE  COEFFICIENTS  ******* 

7550  ?1=Tri+ll-T 

7560  P2=T[l+2J-T 
7570  M2=T-T[I-1 ] 

7580  M3=T-T[ I 1 
7590  C2=A[I-1J 

7600  Cl =(M2*C2+P1*A[l-2] ) / (M2+P1 ) 

7610  C?=(M3*A[ll+P2*C2)/(M3+P2) 

7620  Cl  = (M3*C2+P1*C1 ) / (M3+P1 ) 

7630  PR TURN 

7640  rj.]M***************KNOT  MOVING  SUBROUTINE***************************** 

7650  FOR  15=4  TO  N 
7660  E1=(T[I5+1]-T[I5])/H8 
7670  A5=S1 
7680  X5=T[I5] 

7690  T[I5J=X4=T[I5]+E1 

7700  GOSUB  8070 

7710  IF  SKA 5 THEN  7770 

7720  E1=(T[I5-1]-X5)/H8 

7730  T[I5]=X4=X5+E1 

7740  GOSUB  8070 

7750  IP  SKA 5 THEN  7770 

7760  GOTO  7990 

7770  A4=S1 

7780  RKE1+E1 

7790  T[I5]=X3=X4+E1 

7800  IF  (X3<T[I5-1 ] OR  X3>T[l5+l])  THEN  7990 
7810  GOSUB  8070 
7820  A3=S1 

7830  IF  (A3>A4)  THEN  7890 

7840  A5=A4 

7850  X5-  :4 

7860  A4=A3 

7870  X4=X3 

7880  GOTO  7780 

7890  X6=X3*X3 

7900  X7=X4*X4 

7910  X8=X5*X5 

7920  R1 = 5*(X6-X7)+A4*(X8-X6)+A3*(X7-X8) 

7930  R2=(A5*(X3-X4)+A4*(X5-X3)+A3*(X4-X5))*2 

7940  T[I51=R1/R2 

7950  GOSUB  8070 

7960  IF  (S1<A4)  THEN  8000 

7970  T[I5]=X4 

7980  GOTO  8000 

7990  T[l5l=T[.I5]-E1 

3000  GOSUB  550 

8010  GOSUB  8070 

8020  PRINT  T[I5],S1 

8030  NEXT  15 

8040  LISP  "PLOT?"; 

8050  WAIT  3000 
8060  GOTO  7650 
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8070  REM***********C0MPUTES  SUM  OP  SQUARES************************ 

8080  S1=0 
8090  I=K 

8100  POR  L- 1 TO  L9 
110  T=X[L] 

8120  IF  T[I+1]  >=  T THEN  8170 
8130  1=1+1 

8140  IF  I <=  N+1  THEN  8120 
8150  PRINT  "OUT  OP  RANGE"; 

8160  STOP 
8170  P1=T[l+1 1-T 
8180  P2=T[ I+2J-T 
8190  M2=T-T[I-1 ] 

8200  M3=T-T[ I ' 

8210  C2=A[l-l‘ 

8220  C1=(M2*C2+P1*A[l-2])/(M2+P1) 

8230  C2=(M3*A[I]+P2*C2)/(M3+P2) 

8240  C1=(M3*C2+P1*C1)/(M3+P1 ) 

8250  3=C1-G[L] 

8260  31=S1+S*S 
8270  NEXT  L 
8280  RETURN 

8930  REM**  FINDS  I SUCH  THAT  T(I)<=T<T(I+1 ) USING  FORWARD  SEARCH  ONLY  ** 
8940  IF  T[ 1+1 ] >=  T THEN  8990 
8950  1=1+1 

8960  IF  I <=  N+1  THEN  8940 
8970  PRINT  "RANGE"; 

8980  STOP 
8990  RETURN 
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9000  RUM  ************  DERIVATIVES********************* 

9010  REM  THIS  ROUTINE  CALCULATES  THE  ZEROTH  THROUGH  D1  DERIVATIVES  OF  SPLI' 
9020  REM  AND  DLACES  THEM  IN  P(l)  THROUGH  p(D1+l)  RESP. 

9030  REM  THE  DERIVATIVES  ARE  AT  T 
9040  D1 =3 

' 9050  H1=(T[N+ll-T[K])/60 
9060  FOR  T=T[K]  TO  T[N+l]  STEP  HI 
9070  I=K 
9080  GOSUB  8930 

9090  REM**********************  CALCULATES  N(l,K)  AT  T******************* 

9100  N[1,1]=1 

9110  FOR  3=1  TO  K-1 

9120  ?[S]=T[I+S]-T 

9130  M S]*W[I+1-S] 

9140  N[  ,3+1 ]=0 

9150  FOR  R= 1 TO  S 

9160  Z9=N[R, S]/ (P[R]+M[ 5+1 -R] ) 

9170  NrR,3+1 ]=N[R,S+1]+P[R]*Z9 

9180  N[R+1, 3+1 ]=M[S+1-R]*Z9 

9190  NEXT  R 

9200  NEXT  3 

9210  FOR  S=1  TO  K 

9220  D[S , 1 ]=A[l-S+1 ] 

9230  NEXT  S 
9240  FOR  S=2  TO  K 
9250  FOR  R=S  TO  K 

9260  D[R,5].(D[R-1,S-1]-B[R,S-1])/(T[I+K.R+1]-T[I-R+S])  ' 

9270  NEXT  R 
9280  NEXT  S 
9290  Ml =1 

9300  FOR  1=1  TO  D1+1 
9310  31=0 

9320  FOR  J=1  TO  K-I+1 

9330  S1=S1+D[K+1-J, I ]*N[ J,K+1 -I ] 

9340  NEXT  J 
9350  P[ 1 ]=S1*M1 
9360  Ml =M1* (K-I) 

9370  NEXT  I 

9380  PRINT  T,P[1],P[2],P[3] 

9390  NEXT  T 
9400  STOP 

9500  REM  ***  SPLINE  FUNCTION  FNS(T)  *********************************** 

9510  DEF  FNS(T) 

9520  I=K 
9530  GOSUB  8930 
9540  GOSUB  7550 
9550  RETURN  Cl 
9560  END 
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10  R.HM*******  CUBIC  SPLINE  PROGRAM  ***** 

20  REM  ****  SUBROUTINE  JUMP  ADDRESSES  ******************************** 


30  REM  (1)  SPLINE  GRAPH . 7000 

40  REM  (2)  ERROR  GRAPH 7380 

50  REM  (3)  DATA  POINT  PLOT 7290 

60  REM  (4)  RETURN  TO  KNOT  MOVING  ROUTINE  — 7640 

70  REM  (5)  DERIVATIVE  ROUTINE 9000 

80  DIM  B[30],E[30].X[50],G[50],H$[1],Q[30] 


90  DIM  N[6f6],T[30],P[30],M[30],D[6>6JfA[30],C[30],S[6] 
100  K=4 

110  REM***  H8  USED  IN  KNOT  MOVING  ROUTINE  *** 

120  H8=1 28 

130  PRINT  "CUBIC  SPLINE  PROGRAM" 

140  PRINT  "K  EQUALS "K 

150  DISJ  "NUMBER  OP  DATA  PTS"; 

160  INPUT  L9 

170  PRINT  »L9  EQUALS "L9 

180  DIS?  "V/ANT  TO  LOAD  DATA  FROM  TAPE"; 

190  INPUT  HQ 

200  IF  H$="Y"  THEN  300 

210  DISF'  "2  FILE  #S  WHERE  DATA  GOES"; 

220  INPUT  A8,A9 

230  FOR  1=1  TO  L9 

240  DISP  "INPUT  X,Y"I; 

250  INPUT  X[I],G[I] 

260  NEXT  I 

270  STORE  DATA  A8,X 
280  STORE  DATA  A9,G 
290  GOTO  370 

300  DISJ  "2  FILE  #S  WHERE  DATA  IS"; 

310  INPUT  A8,A9 
320  LOAD  DATA  A8,X 
330  LOAD  DATA  A9,G 
340  DISP  "PRINT  DATA"; 

350  INPUT  HQ 
360  IF  HQ#"Y"  THEN  400 
370  FOR  1=1  TO  L9 
380  PRINT  I,X  [I],G[I] 

390  NEXT  I 

400  DISP  "NUMBER  OF  EQUATIONS=?"; 

410  INPUT  N 

420  N 1 =N - 1 

425  N2=N-2 

430  PRINT  "N="N 

440  FOR  1=1  TO  N+K 

450  DISP  "KNOT  IN  T(I)=?"I; 

460  INPUT  T[I] 

470  PRINT  "T(I)="T[I],I 
480  NEXT  I 
490  GOSUB  550 
500  GOSUB  8100 

510  PRINT  "SUM  OF  SQUARES  ="S1 
520  DISP  "PLOT?"; 

530  WAIT  3000 
540  GOTO  7640 
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350  PJDM  *************SUBR0UTINE  EQUATE*****************************'******** 
560  FOR  11=1  TO  N 

570  B[I1 ]=C[I1 ]=P[I1 ]=M[I1 ]=Q[I1 ]=0 

620  NEXT  II 
630  I=K 
640  11=0 

650  FOR  L=1  TO  L9 

660  I?  I=N  THEN  710 

670  IF  X[L]<T[I+l]  THEN  710 

680  1=1+1 

690  II =I-X 

700  GOTO  660 

710  T=X[L] 

720  P1=T[l+1 1-T 
730  P2=T  1+2  -T 
740  P3=T[I+3‘-T 
750  Ml =T-T[l " 

760  M2=T-T "l-l " 

770  M3=T-T[ 1-2* 

780  Z9=1/(P1+M1) 

790  S1  = P1*Z9 
800  S2=M1*Z9 
810  Z9=31 / (P1+M2) 

820  31=P1*Z9 
830  C2=M2*Z9 
840  Z9=S2/(P2+M1) 

850  32=C?+P2*Z9 
860  33=M1*Z9 
870  Z9=81 / (P1+M3) 

880  S1=P1*Z9 
890  C2=M3*Z9 
900  29=32/ (P2+M2) 

910  32=C2+P2*Z9 
920  C3=M2*Z9 
930  Z9=S3/(P3+M1) 

940  S3=C3+P3*Z9 
950  S4=M1*Z9 
960  G1=G[L] 

970  12=11+1 

980  B[I21=S1*G1+B[I2' 

990  c[l2]=Sl*S1+C[l2J 
1000  P[I21=31*S2+P[I2' 

1010  M‘l2J*S1*S3+M’l2* 

1020  Q[I2J=31*S4+Q[I2‘ 

1030  12=12+1 

1040  B[I21=S2*G1+B[I2‘ 

1050  C "l2" =S2*S2+C ’l2" 

1060  P "12" =S2*S3+P "12 
1070  M[I2J=S2*S4+M[I2* 

1080  12=12+1 

1090  B[I2]=S3*G1+B[I2' 

1100  C ’12" =S3*S3+C  12 
1110  P[I2J=S3*S4+P[I2' 

1120  12=12+1 
1130  B[l2l=S4*G1+Bri2l 
1140  C[I2J=S4*S4+C[I2J 
1150  NEXT  L 
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1160  ROM  ***************  BANDED  MATRIX  ROUTINE  ************************ 

1170  REM  SOLVES  SYSTEM  USING  CHOLESKY'S  DECOMPOSITION  METHOD 

1100  FOR  J=1  TO  N-3 

1190  J1=J+1 

1200  J2=J+2 

1210  J3=J+3 

1220  E[j!=D=SQR(C[jl) 

1230  P[ j]=D1=P[j]/D 
1240  Mi  J*  =D2=M  * J /D 
1250  Q[ Jj=D3=Q[ Jj/D 
1260  C'JI ]=C[J1 1-D1*D1 
1270  P’jr=P'j1  -D1*D2 
1280  M’JI " =M’J1  -D1*D3 
1290  C " J2 " =C  *J2" -D2*D2 
1300  P[ J2" =P ' J2  -D2*D3 
1310  C[J3J=C[J3]-D3*D3 
1320  NEXT  J 

1340  D=S[N2 ]=SQR(C[N2 J ) 

1350  P[N2l=D1=prN2l /D 
1360  M‘N2'=D2=M[N2J/D 
1370  C N1 ' =C[N1 1-D1*D1 
1380  P'NI J=P[N1 J-D1*D2 
1390  C "n3=C[N]-D2*D2 
1400  E‘N1 ]=D=SQR(C[N1]) 

1410  P'NI J=D1=P[N1 J/D 
1420  E'n1=SQR(C[N]-D1*D1) 

1430  CV=B[1  J/E[1] 

1 440  C'2'=(B[2]-C[ll*P[ll)/E[2] 

1450  C[ 3 J= (B[ 3 j-M[ 1 J*C[ 1 J-P[2]*C[2] )/E[ 3] 

1460  FOR  J=4  TO  N 
1470  J1=J-1 
14  80  J2=J-2 
1490  J3=J"3 

1500  C[J1=(B[J]-M[J2]*C[J2]-P[J1]*C[J1]-Q[J3]*C[J3])/E[J] 

1510  NEXT  J 
1520  ArN]=C[N]/E[N] 

1530  A'NI ]=(C[N1 J-P[Nll*A[N])/ErNl1 

1540  A[N2 ]= (C[N2 J-P[N2 J*a[N1 J-M[N2 J*A[Nl )/E[N2] 

1550  FOR  J=N-3  TO  1 STEP  -I 
1560  J1=J+1 
1570  J2= J+2 
1580  J3=,J+3 

1590  A[J  !-(C[J]-P[j]*A[J1]-M[J1*A[J2]-Q[J]*A[J3])/E[J] 

1600  NEXT  J 
1610  RETURN 
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7000  REM*********SPLINE  GRAPH********************************************** 

7010  DISP  ”4  SCALE  VALUES”; 

7020  INPUT  Q9,Q8,Q7,Q6 
7030  SCALE  Q9,Q8,Q7,Q6 
7040  DISP  ”2  VALUES  X-CROSS  AND  XTIC” ; 

7050  INPUT  Q7,Q6 
7060  XAXIS  Q7.Q6 

7070  DISP  ”2  VALUES  FOR  Y-CROSS  AND  YTIC"; 

7080  INPUT  Q7,Q6 
7090  YAXIS  Q7,Q6 
7100  DISP  "STEP 
7110  INPUT  Q5 
7120  I=K 

7130  FOR  T=Q9  TO  Q8  STEP  Q5 
7140  GOSUB  8930 
7150  GOSUB  7540 
7160  PLOT  T,  Cl 
7170  NEXT  T 

7180  I=K  . . 

7190  FOR  Q6=K+1  TO  N 
7200  PEN 
7210  T=T[q6] 

7220  GOSUB  8930 
7230  GOSUB  7540 
7240  PLOT  T,C1 
7250  CPLOT  -0.3, -0.3 
7260  LABEL  (*)”0" 

7270  NEXT  Q6 
7280  STOP 

7290  REM  ******  DATA  POINT  PLOT  **************************************** 

7300  FOR  11=1  TO  L9 
7310  PLOT  X[I1],G[I1] 

7320  CPLOT  -0.3, -0.3 
7330  LABEL  (*)”X" 

7350  PEN 
7360  NEXT  II 
7370  STOP 

7380  REM  *****  ERROR  PLOT  ROUTINE  ************************************ 

7390  Y=3*SQR(S1/L9) 

400  SCALE  X[1 ],X[L9],-Y.Y 
7410  XAXIS  0, (X[L9]-X[1 J)/10 
7420  YAXIS  X[ 1 ] , Y/6 

7430  PRINT  "XTIC  =”(X[L9]-X[1 ])/10wYTIC«wY/6 
7440  I=K 

7450  FOR  J=1  TO  L9 
7460  T=X[ J ] 

7470  GOSUB  8930 
7480  GOSUB  7540 
7490  PLOT  T,G[J]-C1 
7495  PLOT  T, 0 
7500  PEN 
7510  NEXT  J 
7520  STOP 
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7550  REM  *******  CALCULATE  SPLINE  VALUE  FROM  SPLINE  COEFFICIENTS  ******* 

_ r*  . *1  , 


7540  PI  &T 
7550  P2=T 


7560  P3=T[I+3 


1+1 

1+2 


-T 

-T 


7570  M2=T-T[I-2J 


7580  M3=T-T 
7590  M4=T-T 


[if 


7600  C1=(M2*A[l-2l+P1*A[l-3l)/(M2+Pl} 

— ^ , r J 1 . TtAll  1 rj  2*  j ‘ ^ 


7610  C2=(M3*A '1-1 ]+P2*A[ 


M3+P2) 


615  C3=(M4*A[l]+P3*A[I-l])/(M4+P3) 
7620  C1=(M3*C2+P1*C1)/(M3+P1' 

7625  C2“(M4*C3+P2*C2)/(M4+P2 
7630  C1=(M4*C2+P1*C1)/(M4+P1 


7635  RETURN 

7640  REM*******  ********KN0T  MOVING  SUBROUTINE***** ************ ******  ***■» 

7680  FOR  15=5  TO  N 

7690  E1=(T[I5+1]-T[I5])/H8 

7700  A5=S1 

7710  X5=T[I5]  , ■ 

7720  T[I5J*X4«T[I5]+E1 

7730  GOSUB  8100 

7740  IF  S1<A5  THEN  7800  . 

7750  E1=(T[I5-1]-X5)/H8 

7760  T[I5]=X4=X5+E1 

7770  GOSUB  8100 

7780  IF  S1<A5  THEN  7800 

7790  GOTO  8020 

7800  A4=S1 

7810  E1-E1+E1 

7820  T[I5]=X3=X4+E1 

7830  IF  (X3<T[I5-1]  OR  X3>T[I5+l])  THEN  8020 
7840  GOSUB  8100 
7850  A3=S1 

7860  IF  (A3>A4)  THEN  7920 

7870  / 5=A4 

7880  X5=X4 

7890  A4=A3 

7900  X4=X3 

7910  GOTO  7810 

7920  X6=X3*X3 

7930  X7=X4*X4 

7940  X8=X5*X5 

79 50  R1 =A5* (X6-X7 ) +A4* (X8-X6) +A3* (X7-X8) 

7960  R2=(A5*(X3-X4)+A4*(X5-X3)+A3*(X4-X5))*2 

7970  T[I5]=R1/R2 

7980  GOSUB  8100 

7990  IF  (S1<A4)  THEN  8030 

8000  T[I5]=X4 

8010  GOTO  8030 

8020  T[I5]=T[I5]-E1 

8030  GOSUB  550 

8040  GOSUB  8100 

8050  PRINT  T[I5],S1 

8060  NEXT  15 

8070  DISP  "PLOT?"; 

8080  WAIT  3000 

«r»an  Ofvpn  7finn  _ _ 


8100  rem***********cgmfutes  sum  op  SQUARES***** ******************* 

8110  S1=0 
8120  I=K 

8130  FOR  L=1  TO  L9 
8140  T=X[L] 

8150  IF  T[I+l]  >=  T THEN  8200 
8160  1=1+1 

8170  IF  I <=  N+1  THEN  8150 
8180  PRINT  "RANGE'' 

8190  STOP 
8200  GOSUB  7550 
8210  S=C1 -G[L] 

8220  S1=S1+S*S 
8230  NEXT  L 
8240  RETURN 

8930  REM**  FINDS  I SUCH  THAT  T(l)<=T<T(I+1 ) USING  FORWARD  SEARCH  ONLY*** 
8940  IF  T[l+1 ] >*  T THEN  8990 
8950  1=1+1 

8960  IF  I <=  N+1  THEN  8940 
8970  PRINT  "RANGE"; 

8980  STOP 
8990  RETURN 


9000  REM  ************  DERIVATIVES********************* 

9010  REM  THIS  ROUTINE  CALCULATES  THE  ZEROTH  THROUGH  D1  DERIVATIVES  OF 
9020  REM  AND  PLACES  THEM  IN  P(l)  THROUGH  p(D1+1)  RESP.  SPU^ 

9030  REM  THE  DERIVATIVES  ARE  AT  T 
9040  D1=3 

9050  H1=(T[N+1 ]-T[K])/60 

9060  FOR  T=T[K ] TO  T[N+1]  STEP  HI 

9070  I=K 

9080  GOSUB  8930 

9090  REM** ********************  CALCULATES  N(I,K)  AT  T *************** 

9100  N[l',l]=1 

9110  FOR  S=1  TO  K-1  v 

9120  P[S]=T[I+S]-T 
9130  M[S>T-T[I+1-S] 

91 40  N[ 1 , S+1 ]-0 

9150  FOR  R=1  TO  S 

9160  Z9=N[R,S]/(P[R]+M[S+1-R]) 

9170  N[R,S+1]=N[R.S+1]+P[R]*Z9 

9180  n[r+1,S+1]=M[S+1-R]*Z9 

9190  NEXT  R 

9200  NEXT  S 

9210  FOR  S=1  TO  X 

9220  D[S, 1 ]=A[l-S+1 ] 

9230  NEXT  S 
9240  FOR  S=2  TO  K 
9250  FOR  R=S  TO  K 

9260  D[R, S ]= (D[R-1 ,S-1 ]-D[R,S-1 ]) / (T[l+K-R+1 ]-T[I-R+S]) 

9270  NEXT  R 
9280  NEXT  S 
9290  Ml =1 

9300  FOR  1=1  TO  D1+1 
9310  S1=0 

9320  FOR  J=1  TO  K-I+1 

9330  S1=S1+D[K+1-J,I]*N[J,K+1-I] 

9340  NEXT  J 
9350  P[I]=S1*M1 
9360  Ml =M1* (K-I ) 

9370  NEXT  I 

9380  PRINT  T,P[l],P[2]fP[3] 

9390  NEXT  T 

9500  REM  ***  SPLINE  FUNCTION  FNS(T)  *********************************** 

9510  DEF  FNS(T)  ' 

9520  I=K 

9530  GOSUB  8930 

9540  GOSUB  7540 

9550  RETURN  Cl 

9560  END 
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SOME  NOVEL  ROOTKlNDIffG  METHODS 
Charles  E.Gray 

Aeronutronic  Ford  Corporation 
Palo  Alto, California 


ABSTRACT 


This  paper  develops  some  methods  for  computing  the  real  roots  of 
a real  valued  function  in  a finite  interval.  This  classical  problem  is 
treated  by  methods  that  are  novel  in  the  following  respects:  the  con- 

vergence does  not  depend  on  the  customary  "initial  guess;"  one  can  con- 
clude that  the  function  has  no  roots,  if  that  be  the  case;  if  that  is 
not  the  case,  one  can  compute  all  real  roots  of  the  function  in  the  in- 
terval to  within  a preassigned  accuracy.  It  is  assumed  that  the  func- 
tion is  Lipschitz  and  that  a value  for  the  Lipschitz  constant  is  known. 
These  assumptions  arc  mild  enough  so  that  methods  are  applicable  to  a 
variety  of  problems,  notably  in  optimal  control.  From  the  basic  method 
two  other  algorithms  are  derived:  one  computes  the  maximum  of  a func- 

tion on  an  interval;  the  other  computes  the  roots  of  a function,  but 
with  improved  speed  of  convergence  similar  to  that  of  Newton’s  method. 
Numerical  results  are  presented  which  illustrate  the  properties  dis- 
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cussed  above. 


Introduc  tion 


A classical  problem  in  numerical  analysis  is  that  of  finding  in  a 
given  interval  (which  we  assume  to  be  closed  and  finite)  a real  root 
of  a continuous,  real-valued  function.  Many  algorithms  have  been  proposed 
to  solve  this  problem  ^[l  ]>  ],[  3]  , [ 4 ],[  5]  ,|  6 ] j. 

In  one  way  or  another,  they  all  require  an  initial  guess  at  the  value 
of  the  root;  it  is  hoped  that  this  guess  is  improved  by  an  iterative 
application  of  the  algorithm.  This  hope  may  be  disappointed  for  a number 
of  reasons.  It  may  happen  that  an  initial  guess  appropriate  to  the  algo- 
rithm does  not  exist,  even  though  a root  exists.  Or,  if  an  initial  guess 
and  a root  both  exist,  the  algorithm  may  nevertheless  fail  to  converge, 
because  the  initial  guess  is  not  close  enough  to  the  root.  Of  course,  if 
there  are  no  roots,  no  convergence  is  possible.  Unfortunately,  it  is  not 
possible  to  determine  whether  lack  of  convergence  is  due  to  the  first  or  the 
second  of  these  alternatives.  The  only  recourse  then  is:  if  at  first  you 

don't  converge,  pick  a new  initial  guess  and  try,  try  again. 

By  contrast,  the  method  which  we  present  below  (first  proposed  in 
does  not  require  an  initial  guess  at  the  value  of  the  root.  If  there  is  no 
root  of  the  function  in  the  given  interval,  the  algorithm  reveals  this  fact. 
Otherwise,  the  algorithm  computes  all  roots  of  the  function  in  the  interval* 
To  illustrate  the  problems  that  can  beset  classical  methods,  we  consider 
two  of  them;  the  bisection  method  and  Newton's  method.  We  denote  the  func- 
tion whose  roots  are  sought  by  f(x)  and  the  interval  of  interest  by  |a,bj. 
The  problem  then  is  to  find  an  x e [a,b]  such  that 

f(x)  = 0 
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The  Bisection  Method 


In  the  bisection  method  it  is  assumed  that  we  know  two  numbers  uq € [a,b] 
and  vq£  [a,b],  such  that  f ( u ) and  f (v^ ) are  of  opposite  sign. 

Since  f (x)  is  continuous,  there  is  at  least  one  x6  [uo>v0]  suCh  that 
f (x)  = 0,  (See  p'igure  1.)  The  bisection  method  will  compute  one  root, 
as  foil ows : 


Step 

0 

Guess 

u ,v  such  that  f 

0 0 

<Uo> 

. f ( v ) < 0-  Set  i 
v 0 

Step 

1 

Set  x 

. = 1/2 
l 

(u.  + v.) 
V 1 X 

- If 

f (xd  * 

f (u.)<0 

set 

Vi+i^xi 

Go 

to  Step  2 

- If 

f (X.)  f 

(u.)  = 0 

Stop  * 

x*  is  the  solution, 

l 

- If 

f (x.)  f 

1 

<“i>  > 0 

Set 

ui+i^xt 

Go  to  Step  2 

Step  2 Set  i* — iTl  and  go  to  Step  1 


However,  it  may  happen  that  there  are  no  u and  v ; in  the  example 

o o 

of  Figure  2,  f (x)>0  V x € ra,bl  hence  f (u  ) f (v  ) > 0 

L J o o 

V U C [a,b]  . v e [a,b] 

o o 

Newton’s  Method 

In  Newton’s  method,  it  is  assumed  that  an  initial  guess  XQ  is  given. 


This  guess  is  then  updated  according  to  the  formula 


f(x.) 


Like  the  little  girl,  who  had  a little  curl,  right  in  the  middle  of 
her  forehead,  when  Newton's  method  is  good,  it  is  very,  very  good,  but  when 
it  is  bad  it  is  horrid  (See  Figures  4,  5,  6). 
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A New  Foot  Finding  Method* 


Concept 

Suppose  that  f (x)  Is  Lipschitz  on  [a,b]  * Thus  for  some  M1: 
|f  (Xl)  - f (x2)  | < M’  |Xl  - x2|  V *lf  x2  £ [a,b] 

Assume  that  M1  is  known  and  consider  the  following  algorithm: 


Al£°£iJ-hm 

Step  0 Set  i*— 0,  a. 

Step  1 Compute  f (x  i )* 

Go  to  Step  2 * 

|f(x)| 

Step  2 Set  x£  + *— ft.  *1+1 

Go  to  Step  3 

Step  3 If  x > b,  stop* 

Otherwise  set  i« — - i+1  and  go  to  Step  1* 


The  geometric  interpretation  of  the  algorithm  is  the  following  (cf. 
Figure  7) : 

To  obtain  x^_^,  araw  frqrn  the  point  with  coordinates  (x^,  f(x.))  the 

line  L whose  slope  is  M1  in  magnitude  and  which  intersects  the  x-axis  to  the 
right  of  x^;  the  point  of  intersection  is  x^+^. 


Convergence  Propg^r^tie^s 

The  algorithm  generates  a monotonically  increasing  sequence  j x^  | - If 
this  sequence  is  finite,  then  the  algorithm  stops  in  Step  3 with  x^,  > b* 

If  the  sequence  is  infinite  j j then  it  har?  a limit  in  [a,b]  , 

say  x*  (because  |x^|^q  a sequence  bounded  from  above  by  b)  . 


Then: 


lim  x 

Xi+1 

i — > oo 


lira  = x* 

i ► 00 
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Because  f (x)  is  Lipschitz,  it:  is  continuous  so  that 

lim  I f (x.) [ = I f (x*) f 

00  » 1 1 1 

From  Step  2 of  the  algorithm  we  conclude 

1 


* * 
x - x + 


M' 


f (X  ) 


Hence : * 

f (x  ) = 0 

Thus  the  algorithm  e it her  converges  to  a root  in  an  infinite  number 
of  steps  or  it  reaches  the  end  of  the  interval  ( > b)  in  a finite 

number  of  steps  without  computing  a root.  Note  that  this  conclusion  holds 
for  any  positive  value  of  N[ ' . 

But  M r is  a Lipschitz  constant.  This  key  fact  allows  us  to  strengthen 
our  conclusions,  which  we  now  state  in  the  form  of  a theorem. 


Theorem 

Let  f (x)  be  a real  valued  function  on  [a,b]  such  that 

J f (xp  - f (x2)|  < m'  j X1"X2IV  X1>  x2  f £a*b] 

Consider  a sequence Jx  .{  generated  by  applying  Algorithm  1 to  f (x) . 

I L)  L~~  U 

Then  either 

(1)  The  sequence  is  finite,  in  which  case  f (x)  has  no  roots  in  [a,b] 
or 

(2)  The  sequence  is  infinite,  in  which  case  it  converges  to  the 

smallest  x £ [a,b]  such  that  f (x)  “ 0. 


Geometrical  Sketch  of  Proof 

Refer  to  Figure  7*  Since  m1  is  a Lipschitz  constant,  it  is  greater 
than  the  magnitude  of  the  slope  of  f (x)  anywhere  on  [a,b]  . Hence  the 
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line  L,  so  to  speak5  goes  to  zero  faster  than  the  function*  Consequently 
if  f (xt)  > 0,  Chen  f (x)  > 0 V x € Jx^x^J 

Therefore  if  f (a)  > 0 then  f (x)>0  V C [a.xj  for  every  i = 1,  2 

If  the  sequence  x^  "jams  up"  at  x € [a,bj  then  f (x  ) =0  while 
f (x)>  0 V x € [a,x*j  . Otherwise,  f (x)  > 0 V x € [a,b]  as  in  Figure  8. 
For  a formal  proof  see  the  appendix. 


■Realization 

inJ“r£^HFJL\2.n 

Algorithm  1 allows  us  to  determine  the  first  root  of  f (x)  to 

* 

the  right  of  x = a,  say  ; by  applying  Algorithm  1 again  we  can 

compute  the  first  root  to  the  right  of  x^  , say  x*,  i.e.,  the  second  root 
to  the  right  of  x = a and  so  on  until  all  the  roots  of  f (x)  in  [a,b] 
have  been  computed. 

Note  that,  as  stated, Algorithm  1 requires  in  general  an  infinite 
number  of  steps  to  compute  , and  therefore  will  not  be  able  to  compute 

k 

, in  finite  time*  In  order  to  complete  the  computation  in  finite  time, 
we  must  modify  Algorithm  X;  the  result  is  Algorithm  2,  which  we  call  ROOT- 


FINDER. 


Algorithm  2 is  obtained  by  relaxing  the  requirement  that  we  com- 
pute the  set  SQ=  |x^[a,b]  j f (x)  = 0 j to  the  requirement  that  we  » 

compute  some  set  of  the  form  = |x£[a,b]  ||f(x)  | < £ I where  € 

is  a preassigned  number.  Clearly  contains  Sq,  Also,  as  e — * 0, 

S£  — ► S q so  that  is  an  approximation  to  the  set  of  roots  of  f (x)  , 

SqJ  the  closeness  of  the  approximation  €,  can  be  preassigned,  (See  Figure  11) 
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Algorithm  2;  ROOTFTKDFK 


Step  0 

Set  N£*  -1,  i«”0*  Nr-  0, 

Step  1 

Compute  f (x^) 

StfiP-J. 

If  | f (x^  J < c go  to  Step  3; 

Otherwise  go  to  Step  6, 

aamJ. 

If  N =1  go  to  Step  4 ; 

C 

Otherwise  go  to  Step  5, 

Step  4 

Set  £ j ■*" — Xj  t go  to  Step  5 . 

Steje_5 

Set  Ne*-0,  h* ”1 — , Nk  - 

Go  to  Step  9. 

Step  6 

If  ■ 1,  go  to  Step  3; 

Otherwise  go  to  Step  7. 

Step  7 

Set  J 4— — j+] 

Go  to  Step  8 . 

Step  8 

Set  N > *■ — 1 , 4* 

£ Hf 

Go  to  Step  9- 

Step  9 

Set  x . +—  x.  + h 

l+l  l 

i + - - i + 1 

Step  10 

If  < b go  to  Step  1 ; 

Otherwise  go  to  Step  11  * 

Step  11 

If  N£  * 1 go  to  Step  13; 

Otherwise  go  to  Step  12  . 

Step  12 

Set  b H j J + 1 — * j , go  to  3 t. 

Step  13 

If  Nr  “ 0,  repor  t "Thei  e are  no 

set  repo  t;  f,Roo  s 1 io  i.  the  intervals  ^ 

and  stop* 


0 be  given 


Otherwise, 
i J " 
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Dl  srus^lop 

Referring  to  the  flow  chart  oi  Figure  > we  see  that  is  updated 

in  one  of  two  ways: 

1)  If  j £ (x.)  J > « then 

+ kj 


Xi+1  “ Xi 


M' 


as  in  Algorithm  1.  The  step  size  Is  variable,  being 

proportional  to  j f(xj.)  J. 

2)  If  Kyi*  then 

"i+I  ' *1  + -gr-  . 

In  this  case  the  step  size  xi+i_Ki  constant* 

The  flag  takes  on  the  value  0 if  at  the  previous  , | f (x^)  J < 6 

and  takes  on  the  value  1 otherwise* 

To  understand  algorithm  2 refer  to  Figure  10  which  depicts  the  function 
f (x)  in  the  vicinity  of  a .zero  crossing*  The  path  followed  through  the 
algorithm  is  analyzed  in  Table  1* 

The  key  points  are  the  following: 

1)  If  Nj  s 1 then,  because  of  the  property  of  the  variable  step  size 

of  Algorithm  1 the  function  does  not  vanish  in  [ ^ , xiJ4 

2)  If  N£  = 0,  xL  is  incremented  by  the  constant  step  size  which  is 

chosen  small  enough  so  that  in  one  step  | f(x)  J cannot  exceed  € (due 
to  the  bound  on  the  slope  of  | £(x)  | )_ 

3)  If  s 1 and  J f (x)  | < £ the  algorithm  enters  an  interval 

f £ j » TJ  j j where  by  2)  above  j f (x)  J does  not  exceed  6 . 

4)  If  6 0 and  | f (x)  j > £ the  algorithm  leaves  an  interval 

f £ y H jj  where  I t (x)  j does  not  exceed  € „ 
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Remark : 

Considering  the  simplicity  of  the  concept  underlying  R00TF1NDER  and 
GNEWTON , it  seems  remarkable  that  these  algorithms  have  remained  undis- 
covered until  now,  yet  such  seems  to  be  the  case.  This  is  not  to  say  there 
haven 1 1 been  some  near  misses. 

Thus,  for  example,  Milne  [2]  discusses  algorithms  of  the  form 

X.  = X. f (Xj) 

l+l  1 m K V 

Various  choices  for  m are  considered  but  the  choice  of  a Lipschitz  con- 
stant is  not  among  them.  Also,  the  use  of  the  absolute  value  of  the  second 
term  on  the  right  hand  side  is  not  considered;  this  would  ensure  that  the 
search  always  proceeds  to  the  right,  as  in  ROOTFINDER. 


99 


In_trod  u c ion 

Algorithm  2 can  be  modified  to  find  approximately  the  maximum  of  the 
function  f(x)  on  [a,b].  The  idea  is  to  find  the  zeros  of  f(x)~c, 
where  c is  a constant;  then,  by  some  search  procedure,  increase  c until 
the  algorithm  indicates  that  f(x)-c  is  non-positive  but  not  strictly  neg- 
ative on  [a,b]  . 

Algorithm  3 which  we  call  MAXFINDER  described  below,  computes  a 
value  c + € which  exceeds  the  maximum  of  f(x)  on  [a,b]  by  no  more  than 
£ (where  € is  a preassigned  number)  together  with  intervals  [£  j , 7] 
where  f(x)  lies  in  the  band  c-6  , c+C  (See  Figure  12)  « 


Algorithm  3: 


Step  0 


Step  1 


Step  2 


Step  3 


Step  4 


Step  5 


Step  7 


Step  8 


Step  9 


Step  10 


Step  11 


Step  12 


KAXFICTER 

Set  0 — ^ WNf  , O^i  1 j a | 

go  to  Step  1* 

Compute  f (x^) . Go  to  Step  2* 

If  g0  to  Step  3* 

Otherwise  go  to  Step  4. 

Set  f (x^)  ^ c 1 — ^ Go  to  Step  4. 

If  f (xp  < c + € Go  to  Step  8. 

Otherwise,  go  to  Step  5. 

Set  f (x.)  4 c 1 S-  j*  • Go  to  Step  6. 

Set  x,  £ Go  to  Step  7, 

Set  € h,  0 N Go  to  Step  13* 

M 1 

If  f (x  ) > c - go  to  Step  9* 

Otherwise,  go  to  Step  10* 

If  N “ 0S  go  to  Step  7. 

Otherwise  go  to  Step  6. 

If  ~ 0,  go  to  Step  11* 

Otherwise  go  to  Step  12, 

Set  1 ->  Nf  , xi_1  rjj,  j+1  -»  j 
Go  to  Step  12. 

Set  c ~ f -4  h.  Go  to  Step  13. 

M 1 


Step  13  Set  + h ^ i + 1 — > i Go  to  Step  14. 

Step  14  If  x,  > b , go  to  Step  15*  Otherwise  go  to  Step  1. 

Step  15  If  = 0 , go  to  Step  16.  Otherwise  go  to  Step  17, 
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Step  16  Set  b T) ^ , j+1  j.  Go  to  Step  17. 

Step  17  Set  j-1  ->  J. 


Report: 


c - € < Max  f (x)  < c -f  € 

c - £ < f (x)  < c + € in  intervalsj 


Stop. 
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Discussion 


Algorithm  3 can  be  understood  by  referring  to  the  flow  chart  of  Figure  14  and 

to  Figure  15,  which  represents  a sample  f(x)  to  be  maximized. 

As  long  as  f(x)  increases  the  algorithm  steps  along  in  steps  of  — ^ , 

M 


updating  c to  f(x^)  whenever  f(x^)  exceeds  the  previous  c by  more  than 


€ ; at  the  same  time  £ . is  updated  to  x^.  When  f(x^)  starts  to  decrease, 

the  value  of  at  which  the  decrease  is  noted,  i.e.,  the  previous  to  the  cue 

such  that  f (x^)  - c < - f , is  retained  as  rjj*  The  ciirrent  approximation 


to  the  maximum  is  then  the  current  c while  the  location  of  the  n 


maximum  is 


approximated  by  £ . , p , J . 


When  f(x)  decreases,  the  algorithm  steps  along  in  variable  steps  pro* 
portional  to  c - f(x)  in  the  manner  of  Algorithm  1. 

1 2 

In  Figure  15,  successive  approximations  to  the  maximum  are  c , c , . <, . . 
c^.  After  c6  the  function  starts  to  decrease  so  c^  is  taken  to  be  c,  the 
approximation  to  the  maximum.  Likewise,  £ is  taken  to  be  the  approximation 
to  the  left  hand  end  of  the  6-  interval  around  the  maximum.  When  f(xp  < 
c - € , *s  taken  to  be  D ^ and  the  algorithm  starts  stepping  in  vari- 

able  steps  proportional  to  c-f(x)  in  the  manner  of  Algorithm  1.  The  pro- 
cedure may  be  likened  to  plotting  £(x)  on  the  side  of  an  aquarium,  then 
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filling  the  aquarium  with  water  until  the  maximum  of  the  curve  f(x)  just 
touches  the  surface  of  the  water.  Notice  that  this  is  a direct  search, 
which  yields  the  global  maximum*  This  is  in  contrast  to  the  method  of 
searching  for  the  set  of  roots  of  the  derivative  of  f (x) , which  may  or 
may  not  contain  the  global  maximum  (See  Figure  13)* 

Acceleration  of  Convergence 

Introduction 

One  of  the  virtues  of  Newton's  method  is  that  it  exhibits  very  rapid 
convergence  when  f,close  enough'1  to  a root  of  f(x).  It  is  possible  to  com- 
bine Algorithm  2 and  Algorithm  3 to  give  a new  algorithm.  Algorithm  4,  which 
we  call  GNEWTON,  which  has  the  convergence  properties  of  Newton's  method  in 
the  vicinity  of  a root  but,  in  the  manner  of  Algorithm  2,  computes  all  roots 
of  f(x)  in  the  interval  [a,b]  (or  indicates  that  there  are  none)  without 
requiring  an  initial  guess.  This,  Newton's  method  cannot  do. 

Algorithm  4 is  based  on  two  ideas:  the  first  is  to  introduce  an 

iteration  on  € : the  interval  [a,b]  is  scanned  by  ROOTFINDER  with  a coarse 

value  of  £,  ROOTFINDER  computes  £ -intervals  [ x^,  ] [ x^>  J , 

Then  £ is  refined  so  that^say^new  £ - £*  < old  e.  As  we  saw  above 

f(x)  =j:  0 outside  of  [x^,  y^J...^x^,  y^J  , so  on  the  next  pass  ROOTFINDER 
need  only  scan  [x^,  y^j  , ...»  y^]  * The  algorithm  then  computes  £f  - 


» 

k 

vals  are  scanned  and  so  on  until  £ is  less  than  some  preassigned  £ , 
Significant  improvements  in  computational  efficiency  result  when  this  idea 
is  combined  with  the  following  one:  prior  to  scanning  an  £ -interval 

[Xr»  y^],with  ROOTFINDER,  MAXFINDER  is  used  to  estimate  the  maximum  value 
of  the  slope  of  f(x)  on  [ x^,  y^  ] . The  result  is  then  used  as  M1  in 

ROOTFINDER. 


Again  e is  refined,  the  £ inter** 
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Algorithm  4:  GNEWTON 

Step  0 Set  a — ►x,  b — *y.  Choose  6*  and  € ->  6*.  Set  1 — *K — ►k. 
Step  1 Apply  MAXFINDER  to  the  function  f*(x)  on  the  interval [x,yj 
to  compute  Kf « 

Step  2 Using  M*  computed  in  Step  1,  apply  ROOTFINDER  to  f(x)  on 
the  interval  [x,yj. 


[V’jJl  1 


in  Stack  B 


Store  the  resulting  6-intervals 
(Number  these  sequentially  as  they  are  computed). 

Step  3 If  k < K go  to  Step  6.  Otherwise  go  to  Step  4. 

Step  4 If  6 < £ * Report  results. 

Otherwise  go  to  Step  5 , 

Step  5 Refine  6 # 

Note  K,  the  total  number  of  6 'intervals  just  computed 


Transfer 


f 


I 


ik  k 


the  6-intervals  just  computed,  from 


Stack  B to  Stack  A,  which  contains  the  intervals  to  be  scanned 


on  the  next  pass;  label  them 
Clear  Stack  B . 

Set  1 ►k 

Go  to  Step  7. 

Step  6 Set  k + 1 ►k*  Go  to  Step  7. 

Step  7 Set  ►x  * y. 

Go  to  Step  1. 


[xi>yi]  > — [vyK]  • 


Discussion 

Algorithm  4 can  be  understood  by  referring  to  the  flowchart  of  Figure  16 
and  the  discussion  in  the  introduction.  Some  further  discussion  of  the  entries 
in  Stack  B is  appropriate:  the  intervals  [^j  ^ ^j  j are  numbered  sequentially 

in  the  order  in  which  they  are  computed.  For  example,  suppose  that  in  scanning 


105 


Ll*  three  intervals  are  computed;  these  are  numbered  1,  2,  3 from  left 

to  right.  Then  in  scanning  |x2 , , two  more  intervals  are  computed;  these 

would  then  be  numbered  4 and  5;  and  so  on.  An  example  is  given  in  Figure  17. 


Com ^ rgen c e P r o p£ r jt i es 

As  mentioned  before,  when  close  enough  to  a root,  Newton fs  method  ex- 
hibits very  rapid  convergence.  To  see  why  this  is  so,  let  us  consider 
Newton's  iteration  formula  in  the  vicinity  of  a simple  root  x . 


x.  ,i  “ x,  - 

r+1  i 


f (*t) 


Using  Taylor's  expansion  around  x for  ffx^)  and  f'(x^)  there  follows: 

* * f (**)  + f ' (x*)  ( Xj-X*)+  O ((X.-X*)2) 

X.  ,«  * X = X * - x 

1+J-  1 f'(x*)  +o  (xt-x*) 


where  o (*)  denotes  any  function  such  that: 


lim 
Ct-*  0 


Noting  that  f(x*)  = o and  f % (x*)  f 0 (because  x*  is  a simple  root)  there 
follows: 

xi4l-x*  = o ((Xj-X*)2) 

Hence  the  speed  of  convergence  [ 7 ] of  the  method  is  at  least  quadratic « 
Notice  that  the  same  result  would  be  obtained  with  any  iteration  scheme 
of  the  form: 

foo 

X.  = X.  - 1 

1+1  1 0(X.) 

r 

provided  only  that: 

0 (x±>  = f'(x*)  + O (xt-x*) 


106 


In  fact,  ONE VJ TON  appears  to  be  just  such  an  Iteration  scheme.  This 
is  suggested  by  the  following  argument.  Consider  (for 

simplicity)  the  case  of  a single,  simple  root  x of  f(x)*  Denote  by 
the  successive  £ -intervals  around  it  computed  by  G NEWTON  and  by 
m*  aW  ’)  the  corresponding  value  of  M1  computed  by  MAXFINDER.  Then: 

Max  f'  (x)  - € < M'  ( i;  1,  ril  ) < Max  f'(x)  + € 


If  the  sequence  € converges  to  0,  we  obtain 

M'  (£*,  r?1)  = | £ ' (X*)  j + o ( £ 1 - x*) 

It  then  seems  plausible  to  assert  that,  for  small  J £ 1 - x*  j the 
iteration  scheme  is  of  the  form  : 


£ 


i-fl 


If  liiil 

M'(£\  ^ 


4 1 L L ih 

f ' (x^)  + O ( £ ^-x  ') 


which,  by  what  was  said  above,  assures  Newton  convergence.  This  con- 
jecture is  supported  by  the  numerical  behavior  of  GNEWTON , as  will  be 
discussed  below* 


Discussion 

Comparison  of  assumptions 

Let  us  compar  '*  first  the  assumptions  made  in  using  bisection  and  in 
using  ROOTFINDER.  In  both  cases  f(x)  is  assumed  to  be  continuous.  In 
addition,  ROOTFINDER  assumes  that  the  slope  of  f(x)  is  bounded  and  that  a 
bound  M * is  known,  while  the  bisection  method  assumes  that  f(x)  changes 
sign  on  [a,b]  - 

Among  the  questions  that  we  might  ask  at  this  point  are  the  following. 
Are  these  assumptions  unduly  restrictive?  Are  they  easy  to  verify?  These 
are  rather  vague  questions,  so  the  answers  will  likewise  be  somewhat  vague* 
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* 


In  many  applications  the  functions  of  interest  are  continuous.  The 
class  of  band  limited  functions  * for  example,  are  not  only  continuous  but 
have  bounded  slope  and  the  maximum  value  of  the  slope  can  be  predicted  on 
the  basis  of  the  bandwidth.  In  other  cases,  the  assumptions  can  be  verified 
by  inspection  of  the  analytical  expression  for  f(x)  together  with  a few 
rough  numerical  calculations* 

It  is  worth  noting  that  the  assumptions  underlying  ROOTFINDER  buy  some- 
what more  than  those  underlying  bisection,  since  ROOTFINDER  provides  informa- 
tion about  all  the  roots  of  f(x),  including  none  if  there  are  none* 

If  we  now  compare  GNEWTON  and  NEWTON,  we  find  that  both  assume 
that  f(x)  is  continuously  differentiable.  In  addition  GNEWTON  assumes 
that  the  slope  of  | £ * (>^)  j is  bounded  and  that  a bound  M11  is  known*  This 
additional  assumption  buys,  as  above,  information  about  all  the  roots  of  f (x) . 

Since  these  assumptions  are  stronger  than  those  made  for  ROOTFINDER  or 
bisection,  the  class  of  functions  to  which  GNEWTON  or  NEWTON  can  be  applied 
is  more  restricted.  This  class  is  by  no  means  empty,  as  the  following  non- 
trivial example  will  show. 

In  solving  a certain  linear  optimal  control  problem  ( [?]  , [9]  ) it  is 

necessary  to  find  the  roots  in  [a,b]  of  a function  f(x)  of  the  form 

f(x)  = <f>  (x)  - /J 

where  yj  is  a constant  and  <f>  (x)  is  a given  function  of  the  form: 

<f>  (x)  . V aL  e Aix  real,  + 0 ) 

i»l 

Thus,  the  derivative  0*  (x)  is  given  by: 

n A-  x 

0'<X)  * 2 a£  \L  e 1 

1=1 
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The  upper  bound  Mm  on  the  slope  of  0'can  be  crudely  estimated  from 
the  second  derivative  0 (x) : 

ii  n 2 

<t>  (*)  “ - ai  e 

i-1 

Such  an  estimate  is: 


M 


„ Aia 

n 1 l 

2 e 

- hi 

Ai  X k.b 

| Ai 

i“l 

if  Ai  < 0 

(Note  that 

if  ^ i > 0 a<b) 


This  problem  must  be  solved  for  a sequence  of  values  of  /?  , so  that  it; 
is  important  to  have  the  fastest  convergence  possible.  It  is  also  important 
to  compute  all  the  roots  of  f(x)  in  [a,b] . Thus,  GNEWTON  appears  to  be 
made  to  order  for  this  application. 


Comparison  of  Numerical  Performance 

Newton's  method,  GNEWTON  and  ROOTF INDER  were  applied  to  the  problem  of 
finding  the  roots  of  f(x)  on  [a,b]  for  the  particular  case  where: 

f(x)  = k 4-  sin  2 77  x 
[a,b]  = [O,  l] 

The  constant  k was  chosen  successively  to  be  0,  1 and  2,  thus  illus- 
trating the  case  \ here  f(x)  has  several  simple  roots,  one  double  root  and  no 
roots  at  all  (See  Figure  IS) . 

The  termination  criterion  was 

I f(x)  | < £ 

-2  -4  -6 

Three  values  of  € were  used,  10  ,10  ,10 

Also,  to  illustrate  the  effect  of  overestimating  Mr  (for  ROOTF INDER)  or 
M"  (for  GNEWTON)  three  values  were  considered: 
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M'  = M"  = Mq  = 2 7T 
M'  = M"  = 5 M 

o 

M*  ■ M"  = 10  M 
o 

Here  Mq  is  the  optimum  value  of  the  Lipschitz  constant  M*  or  M'\ 

The  measure  of  efficiency  of  a method  was  taken  to  be  the  total  number 
of  function  evaluations  of  f(x)  and  f 1 (x)  required  to  meet  the  termination 
criterion. 

To  compare  two  methods,  the  relative  efficiency  was  computed  by  divid- 
ing the  number  of  function  evaluations  for  one  method  by  the  number  of  func- 
tion evaluations  for  the  other.  If  one  method  could  not  converge  (as  for 
example,  when  the  bisection  method  was  applied  to  a non-negative  function) 
the  number  of  steps  was  taken  to  be  infinite. 

The  results  are  presented  in  Tables  III,  IV,  V,  VI,  and  VII,  The  data 
on  which  they  are  based  is  presented  in  Table  II, 

Table  III  compares  the  performance  of  GNEWTON  and  ROOTFINDER,  Because 
of  the  overhead  involved  in  evaluating  the  slope  GNEWTON  is  less  efficient 
than  ROOTFINDER  when  M*  is  equal  to  the  value  of  the  slope  of  f(x)  at  the 
root  (M1  = 2 7r?f  (x)  - sin  2 7tx  i.e,  k = 0)  . 

As  M1  increases  ROOTFINDER  takes  smaller  and  smaller  steps  and  its  ef- 
ficiency decreases.  As  Mn  increases  the  efficiency  of  the  slope  estimations 
in  GNEWTON  decreases  because  of  the  smaller  steps;  however,  the  improved  slope 
estimates  accelerate  the  zero  searching  phase  of  GNEWTON  so  that  overall,  its 
efficiency  relative  to  ROOTFINDER  increases. 

Likewise,  in  the  case  (M*  =2  rt , f(x)  = 1 + sin  2ttx,  i,  e,  k=  1)  the 
value  of  M1  is  much  greater  than  the  value  of  the  slope  at  the  root  (which 


no 


is  0)*  Thus  ROOTFINDER  must  take  small  steps  to  avoid  overstepping  the 
root , while  G NEWTON  can  adapt  its  step  size*  The  result  is  that  GNEWTON 
becomes  significantly  more  efficient* 

Tables  IV  and  V show  the  degradation  in  efficiency  of  the  algorithms  as  a 
function  of  M1  or  Mm  relative  to  their  own  performance  when  m*  or  m"  are 
chosen  optimally  (M*  = Mn  = 2 77 ) . The  degradation  of  GNEWTON  is  slight, 
while  ROOTFINDER  degrades  by  an  order  of  magnitude  when  M*  is  overestimated 
by  an  order  of  magnitude. 

Table  VT  compares  ROOTFINDER  to  NEWTON.  When  M1  is  well  chosen  (M*  - 277) 
and  the  slope  of  the  function  near  the  root  is  close  to  M*  (k  = 0)  ROOTFINDER 
is  only  slightly  less  efficient  than  NEOTON*  As  these  conditions  are  departed 
from,  its  efficiency  decreases.  (The  table  illustrates  only  the  result  of  varying  k)  * 
However,  for  k>l  NEWTON  does  not  converge  while  ROOTFINDER  does,  so  its 
relative  efficiency  is  infinite. 

Table  VI  also  compares  ROOTFINDER  to  BISECTION.  It  shows  that  ROOTFINDER , is 
uniformly  more  efficient  (i*e.  for  every  k and  € , when  M1  “ 2 77). 

Table  VII  similarly  compares  GNEWTON  to  NEWTON  and  BISECTION* 

The  method  is  somewhat  less  efficient  than  either  of  these  methods  except 
in  the  cases  where  they  fail  to  converge  (k  = 1 t 2 for  BISECTION,  k = 2 for 
NEWTON) . 

Finally  Table  VIII  summarizes  these  results  by  tabulating  the  efficiencies 
of  GNEWTON,  NEWTON  and  BISECTION  relative  to  that  of  ROOTFINDER  for  Mf  = 2 ^ 
which  was  chosen  because  it  converges  in  every  case). 

It  is  to  be  noted  that  these  comparisons  are  overly  favorable  to  NEWTON 
and  BISECTION  as  they  do  not  take  into  account  the  computational  effort  neces^ 
sary  to  obtain  an  initial  guess,  which  necessarily  degrades  the  efficiency. 
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Advantages  and  limitations  of  GNEWTON  and  ROOTFINDER 

The  distinguishing  feature  of  GNEWTON  and  ROOTFINDER  is  that  their 
convergence  does  not  depend  on  an^initial  guess".  This  is  especially  im- 
portant if  they  are  to  be  used  as  a subalgorithm  within  a larger  program, 
where  human  supervision  of  a trial  and  error  search  is  impractical. 

The  logic  of  ROOTFINDER  is  simple  enough  that  it  can  be  programmed  on 
a commercially  available  8 register,  49  program  step  pocket  calculator  with  19 
steps  and  4 registers  left  for  programming  the  function  whose  roots  are 
sought.  This  is  sufficient  to  solve  non-trivial  problems.  On  the  other 
hand  it  can,  In  some  cases,  be  prohibitively  inefficient.  This  shortcoming 
is  overcome  to  a degree  by  GNEWTON  at  the  cost  of  a somewhat  more  limited 
applicability  and  more  algorithm  complexity. 

A second  shortcoming  of  these  methods  is  that  it  is  difficult  to  extend 
them. to  several  dimensions.  Indeed,  in  order  to  preserve  the  feature  that 
all  roots  are  computed,  the  computational  effort  must  increase  at  least  ex- 
ponentially with  the  dimension  of  the  space.  For  the  same  reason  it  is  dif- 

ficult to  extend  MAXFINDER  to  many  dimensions. 

Conclusions 

We  have  developed  some  methods  for  computing  the  real  roots  of  a real 
Valued  function  in  a finite  interval.  These  methods  are  novel  in  that  their 

convergence  does  not  depend  on  the  customary  "initial  guess11;  one  can  conclude 

that  the  function  has  no  roots,  if  that  be  the  case;  and  if  it  is  not,  one  can 
compute  all  the  roots  of  the  function  in  the  interval.  On  the  opposite  side 
of  the  coin,  these  methods  appear  to  be  substantially  less  efficient  than  the 
notably  efficient  but  erratic  method  of  Newton.  This  penalty  in  efficiency  may 
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be  a reasonable  price  to  pay  for  the  definitive  nature  of  the  results  obtained. 
This  is  particularly  true  if  these  algorithms  are  to  be  used  as  sub algor ithras 
within  a larger  program,  in  which  case  human  supervision  of  a trial  and  error 
search  would  not  be  practical. 


References : 


R.  W.  Hamming,  Numerical  Methods  for  Scientists  and  Engineers M 
McGraw  Hill  1962 

W.  E.  Milne,  "Numerical  Calculus M,  Princeton  University  Press 
1949 

Wendell  L*  Grove,  ,?Brief  Numerical  Methods"  Prentice-Hall  Inc, 

A*  S.  Householder,  "Principles  of  Numerical  Analysis"  McGraw^ 
Hill,  1953 

Forman  S . Acton,  ,fNumerical  Methods  that  Work"  Harper  & Row 
1970 

Jon  M*  Smith,  "Scientific  Analysis  on  the  Pocket  Calculator" 

John  Wiley  1975 

C.  E*  Gray,  "Optimal  Control  of  Systems  with  combined  pulse 
width  and  pulse  frequency  modulation".  Doctoral  Dissertation 
Department  of  Electrical  Engineering  and  Computer  Sciences, 
University  of  California,  Berkeley,  California,  March  1971 

J,  F.  Traub,  "iterative  Methods  for  the  Solution  of  Equations" 
Prentice-Hall  Inc . 

C.  E,  Gray,  "Optimal  Control  of  Systems  Using  N-State  Controls" 
IEEE  Transactions  on  Automatic  Control  Vol  * AC*'  19,  No*  4 
August  1974 


113 


APPENDIX 


Proof  of  Theorem 


Suppose  first  that  f(a)  =0*  Then  from  Step  2 of  Algorithm  1: 


Suppose  now  that  f(a)>0  (to  fix  ideas).  For  any 
if  f (x^)  > 0 , then  £(x)>0  V x£  xi+1  j . 

Indeed  from  Step  2 of  Algorithm  1 : 

m*  c*t+rxi>=f  fxi)>0 

From  the  Lipschitz  condition 

(1)  f(x)  - f(xt)  > - M'  (X  - xt)  > - M (x1+rx.)  yx 


Hence ; 


f(x)  >0  ¥ x e ^xt,  xi+lj  i - 0,  1,  2 

Thus  the  sequence  j j is  strictly  monotonically  increasing, 
finite,  it  must  be  true  that  for  some  k,  > b since  only 
leads  to  a stop  command  in  the  algor ithmf  In  this  case  : 


If  it  is 
this  condition 


f(x)  >0  V x € [a,  XjJ 

Since  [a,  x^j^[a,b  j ,f  (x)  has  no  roots  in  [a,b  ] , 

Conversely,  if  the  sequence  is  infinite,  then  for  every  1 : 
xt  < b 

Hence  the  sequence  has  a least  upper  bound  x*^b*  Since  the  sequence  is 
monotonic,  it  converges  to  x*.  Therefore,  as  was  shown  above, 

f(x*)  =0 

On  the  other  hand  from  (1)  there  follows; 
f(x)  >0  V x C (a,x*) 

Hence  x*  is  the  smallest  x*  in  [a,b]  such  that  f(x*)  = 0, 
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TABLE  I 


X 

CONDITIONS 
N€  , |f(x)| 

PATH 

THROUGH  ALG0RITH1 

[ * 

0 

1 

| 1-2-6-8-9-10-1 

! . - - 

xi 

1 

>f 

| 1-2-6-8-9-10-1 

r 

*2 

: 

j 

1 

1 

< € 

\ a ' ' u 

1-2-3-4-5-9-10-1 

j X~ 
! J 

0 

< * 

1-2-3-5-9-10-1 

X4 

i 

i 

0 

> C 

j 

1-2-6-7-8-9-10-1 

REMARKS 


initialized  to  1 


computed  as  1 


Transition  of  jf(x) 
> * to  < f 


from 


^ j is  left  hand  end  of  interval 


where  f(x) 


< e , There  is 


at  least  one  such  interval,  hence 


KR  is  set  to  0. 


f(x)  remains  < e 


Transition  of  j f (x)  j from 
< f to  ^ £ 

^ is  Right  hand  end  of  interval 
where j f J < e,  Set  value  of  j 
to  j + 1 as  index  of  next  such 


interval* 


TABLE  II 


Algorithm 

k 

t - 10  '2 

€=  10  ^ 

t = 10  "6 

Rema rka 

0 

7.7 

12.3 

17 

(*)  (See  below) 

R00TFINDER(2  77  ) 

1 

56 

565 

5656 

2 

4 

4 

4 

0 

42 

72.7 

103.3 

(*)  (See  below) 

ROOTFINDER(10  77  ) 

1 

283 

2825 

28280 

2 

19 

19 

19 

0 

84 

145.3 

206.7 

(*)  (See  below) 

R00TFINDER(20;r) 

1 

566 

5650 

56560 

2 

37 

37 

37 

0 

18.7 

25.7 

33.3 

(*)  (See  below) 

GNEWT0N(2  r 7 ) 

1 

46 

92 

161 

2 

7 

7 

7 

0 

22.3 

30 

37 

(*)  (See  below) 

GNEWTON(10  TT  ) 

mm 

95  ■ 

153 

211 

15 

15 

15 

0 

1 

24.7 

33 

40  j 

(*)  3 roots  To  get 

1 

total  function  eval- 

GNEWTON (20  77) 

1 

115 

218 

325 

uations  multiply 

entry  by  3. 

2 

18 

: 18 

18 

0 

10 

i 

17 

23 

BISECTION 

1 

oc 

* ) 

BISECTION  cannot 

i 

converge 

2 

cc 

00 

oc  J 

- 

0 

5 j 

7 

7 

NEWTON 

1 

' 

7 

13 

21 

2 

1 1 

00  ! 

; oc 

00 

NEWTON  cannot 

converge 

Average  Number  of  function  evaluations  per  root  for  various  algorithms. 
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TABLE  III 


Lipschitz 

-2 

-4 

-6 

Constant 

k 

£ = 10 

€ = 10 

io 

REMARKS 

2 1 r 

0 

.41 

.48 

.51 

Optimum  Lipshcitz 

1 

1 * 22 

6.14 

35  * 13 

Constant 

2 

.57 

.57 

.57 

10  7T 

0 

1.88 

2.42 

2.79 

5 x degraded 

1 

2.98 

18.46 

134.03 

Lipschitz  Constant 

2 

1.27 

1.27 

1*27 

20  7 T 

0 

3.4 

4.4 

. 

5.17 

10  x degraded 

1 

4.92 

i 

25.92 

174.03 

Lipschitz  constant 

2 

2.06 

2.06 

2.06 

Efficiency  of  GNEWTON  relative  to  ROOTFINDER  as  a function  of  Lipschitz  constant 
(M1,  M") , accuracy  ( £ ) and  function  (f^(x)  = k + Sin  2 ttx  ), 
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Efficiency  of  ROOTFINDER  (M')  relative  to  ROOTFINDER  (2 u ) 
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TABLE  V 


n 

Lipschitz 

n 

-2 

-4 

-6 

Cons  tant 

K 

€ = 10 

€ =10 

€=10 

Remarks 

0 

1 

1 

1 

2 7 r 

1 

1 

1 

1 

Optimum  value 
of  Lipschitz 

2 

1 

1 

1 

cons  tant 

0 

.18 

.17 

.16 

10  rr 

1 

o 

CM 

.20 

o 

(N 

2 

.21 

.21 

.21 

0 

.09 

CO 

o 

o 

CD 

r 

20  7T 

1 

.10 

.10 

.10 

L- 

2 

.11 

.11 

.11 

Efficiency  of  GNEWTON  (M")  relative  to  GNEKTON  (2  7T  ) 
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TABLE  VI 


ROOTFINDER 
Compared  to 


-2 

-4 

-6 

: = 10 

€=  ib 

!l 

H-i 

o 

Remarks 

1.3 

1.38 

1.35 

Optimum  Lipschitz 
constant  used . for 
ROOTFINDER 

00 

X 

X 

Bisection  cannot 

-X 

j X 

X 

converge 

65 

.57 

.41 

Optimum  Lipschitz 
constant  used  for 
ROOTFINDER 

13 

.02 

.004 

PQ 

sc 

Newton  cannot 
converge 

BISECTION 


1NEWT0N 


1 

X 

0 

1 

2 


Efficiency  of  ROOTFINDER  relative  to  BISECTION  and  NEWTON. 


tf 
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GNEWTON 
Compared  to 

k 

e * to"2 

-4 

e = io 

« 

o 

i— i 

IE 

w 

Remarks 

0 

,54 

.56 

.69 

Optimum  Lipschitz 
constant  used  for 
GNEWTON 

BISECTION 

1 

OO 

oo 

OO 

BISECTION  cannot 
converge 

2 

oo 

00 

CO 

0 

,26 

.27 

.21 

Optimum  Lipschitz 
Constant  used  for 
GNEWTON 

NEWTON 

1 

.15 

.14 

.13 

i 

2 

! 

oo 

00 

oc 

I 

NEWTON  cannot 
converge 

Efficiency  of  GNEWTON  relative  to  BISECTION  and  NEWTON 


TABLE  VIII 


Algorithm  Compared 
to  ROOTFINDER  (2  n ) 

k=0 

k=i 

k=2 

Remarks 

ROOTFINDER  (2  7T  ) 

1 

1 

1 

ROOTFINDER’ s perfor- 
mance degrades  by  a 

ROOTFINDER  (20  rr  ) 

.09 

.10 

.11 

factor  of  10 

GNEWTON  (2  7T  ) 

.41 

1.22 

.57 

GNEWTON  (20  7T  ) 

.31 

.49 

.22 

GNEWTON 's  perfor- 
mance degrades  by  a 
factor  less  than  3 

BISECTION 

.77 

0 

0 

BISECTION  cannot 
converge  for  k > 1 

NEWTON 

■ 

1.53 

8 

0 

NEWTON  cannot 
converge  for  k = 2 

Efficiency  of  various  algorithms  relative  to  the  optimum  ROOTFINDER  algo- 
rithm for  C =10 
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Figure  1.  The  bisection  method  works 


Figure  2.  The  bisection  method  fails 

(there  do  not  exist  uq  , e [a,b]  such  that  f(uo>  ffv^)  < 0) 


Figure  3.  The  bisection  method  fails 
(there  is  no  root) 
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Figure  4.  Newton’s  method  is  very,  very  good 


Figure  5.  Newton's  method  is  horrid 
(because  of  an  inflection  point  near  the  root) 


Figure  6.  Newtons's  method  is  horrid 
(because  there  is  no  root) 
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UlGJMl 


Nf -o\ 

Figure  10.  Analysis  of  Algorithm  2 
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1 - k K 
a -+  X b *+  V 


CALL  MAXFINDEFT 

IX, V)  J 


M'(X,Y) 


/callrootfinderN 

SJ1X,  Y)  / 


STORE  e 

STACK  A * INTERVALS  JUST 

e-INTERVALS  CURRENTLY  COMPUTED 
BEING  SCANNED  IN  TOP  OF  STACK  B 

I VVK  I I 


x , v 

k ' 7k 


x # V 
2 2 


Y >"SCAN  MORE 
V.  INTERVALS? 

\ KK  / 


CT  & C*  K R 

e-INTERVALS  CURRENTLY 
BEING  COMPUTED 


[®  ^ 

k + 1 -*  k’  (5) 


Figure  16.  Flow  chart  of  Algorithm  4:GNEWT0N 
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(b) 

Figure  19.  Effect  of  M on  step  size  of  ROOTFINDER 
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AN  IMPROVED  ITERATIVE  METHOD  FOR  OPTIMIZING 
SYMMETRIC  SUCCESSIVE  OVERRELAXATION 


Vital i u s Benokraitis 

Applied  Mathematics  and  Sciences  Laboratory 
U.S.  Army  Ballistic  Research  Laboratories 
Aberdeen  Proving  Ground,  Maryland  21005 


ABSTRACT.  An  algorithm  is  proposed  to  determine  the  optimum  relaxation 
parameter  u)Q  as  well  as  the  spectral  radius  of  the  iteration  matrix 

corresponding  to  the  symmetric  successive  overrelaxation  method.  The 
algorithm  is  based  on  the  work  of  Evans  and  Forrington  and  Young.  Computa- 
tional results  indicate  that  the  improved  algorithm  converges  to  the 
optimum  parameter  even  when  the  scheme  of  Evans  and  Forrington  fails. 

1.  INTRODUCTION.  Consider  the  linear  system 
(1)  Au  = b 

where  A is  a real,  symmetric,  positive  definite  matrix  of  order  N.  The 
real  N-vector  b is  given  and  the  N-vector  u is  to  be  determined. 

Systems  of  the  form  (1)  arise  in  the  finite  difference  solution  of 
boundary  value  problems  involving  elliptic  partial  differential  equations. 

In  particular,  we  shall  be  concerned  with  the  generalized  Dirichlet  problem 
involving  the  differential  equation 

<2>  Ltu]  (A  g)  * £ (C  f)*  F„  = T 

where*  A = A(x,y)  > 0,  C = C(x,y)  > 0,  and  F = F(x,y)  <_  0 in  R u S.  Here 
R is  a bounded  connected  plane  region  and  S is  its  boundary.  Given  a 
function  g(x,y)  continuous  on  S,  we  seek  a function  u(x,y)  twice  continuously 
differentiable  in  R and  continuous  on  S such  that  L[u]  = T in  R and  u = g 
on  S. 


In  order  to  apply  the  method  of  finite  differences,  we  superimpose  over 
the  region  R a grid  consisting  of  a network  of  horizontal  and  vertical  lines 
spaced  at  intervals  of  h = Ax  = Ay  units  apart.  For  simplicity,  we  assume  the 
spacing  to  be  uniform,  although  this  is  not  a necessary  requirement. 


Now  for  some  hg  and  some  (xg,  yg)  in  R,  we  consider  the  set 


which  contains  all  points  of  the 
and  j.  Following  Young  [1974],  we 


form  (x  + ihg,  yg  + jhg)  for  integers  i 
e assume  that  for  any  point  of  nu  which 


lies  in  R,  the  four  adjacent  points  lie  in  R or  on  S.  Furthermore,  this 
property  is  assumed  to  hold  for  all  h such  that  hg/h  is  an  integer.  More- 
over, we  define  R^  = n R and  S^,  = % n S. 


*The  coefficient  A = A(x,y)  should  not  be  confused  with  the  matrix  A of  the 
system  (1). 
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At  any  point  (x,y)  of  Ru  = Jiu  n R,  the  differential  equation  given  by 
(2)  is  replaced  by  the  symmetric  difference  equation 


(3) 


Lh[u]  ■ 


+ 


4r  |a^x  + yj  [u  (x  + h,  y)  - u (x,  y)] 
A (x  - p y^  [u(x,  y)  - u (x  - h,  y)] 
c (x>  y + [u  (x,  y + h)  - u (x,  y)] 

c (Xj  y ‘ f)  [u  y)  " u (x»  y ~ h)]  | 


+ F (x,  y)  u (x,  y)  = T (x,  y). 


Thus  we  have  transformed  the  continuous  problem  to  a discrete  generalized 
Dirichlet  problem.  That  is,  we  now  seek  to  determine  a function  u defined 
on  Rh  u Sh  such  that  l_h[u]  = T(x,  y)  on  Rh  and  u = g on  Sh. 

Multiplying  (3)  by  -h2  and  bringing  the  known  boundary  values  to  the 
right-hand  side  yields  a system  of  the  form  (1)  where  the  order  of  the 
matrix  A,  assumed  to  be  N,  is  the  number  of  mesh  points  in  R^,.  Besides 
being  positive  definite  the  matrix  A can  be  shown  to  have  Property  A.  Also 
it  can  be  verified  that  the  matrix  A is  an  L-matrix,  irreducible,  and  weakly 
diagonally  dominant  (Young  [1971]). 


Another  property  of  the  matrix  A,  which  naturally  (but  not  exclusively) 
points  to  iterative  techniques  as  a mode  of  solution  for  the  system  (1)  is 
that  A is  large  (the  order  is  about  103  to  10°)  and  sparse  (i.e.,  the  number 
of  nonzero  elements  is  small  compared  to  the  total  number  of  elements  in  A). 


The  iterative  scheme  on  which  we  shall  focus  our  attention  is  the 
symmetric  successive  overrelaxation  (SSOR)  method.  In  particular,  we  shall 
be  interested  in  obtaining  certain  parameters  of  an  accelerated  SSOR  method. 

2.  THE  SSOR  METHOD.  In  order  to  define  the  SSOR  method  it  is  con- 
venient rewrite  (1) in  the  form 

u = Bu  + c 

B = I - D~T  A - L + U 
C = D_1b 

and  where  D * diag  (A)  and  L and  U are  strictly  lower  and  upper  triangular 
matrices,  respectively. 


(4) 
where 

(5) 
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The  SSOR  method  is  defined  as  follows  (Sheldon  [1955]).  Let  i/° 
be  an  arbitrary  initial  approximation  to  the  solution.pf  the  system 
Au  = b,  and  define  the  sequence  u^’,  u('),  u^',...,  by 


(6) 


U(n+W  . „(u<nt«  + Uu(n)  * c)  ♦ (1  - »)  u<n> 

u(nt1>  - „(Lu("+y  ♦ Uu<n+1>  ♦ c>  ♦ (1  - »>  u<n+)*> 

n = 0,  1,  2,... 


Eliminating  u^n+^  we  get 


(7) 


u<n+1>  = s 


u(n>  + 


where 

S = U L 

CO  (O  CO 

= (I  - coU)  ^ (coL  + (1  - co)  I)  (I  - <oL)"l  (coU  + (1 
= co(2  - to)  (I  - coU)"1  (I  - coL)"1  c 


co)l ) 


One  of  our  goals  will  be  to  determine  the  relaxation  parameter  so  that  the 
rate  of  convergence  of  our  method  is  optimized  in  some  sense. 


Let  us  look  at  the  SSOR  method  a little  closer.  Anyone  familiar  with  SOR, 
will  recognize  that  each  SSOR  iteration  is  composed  of  two  SOR  half  iterations. 
The  first  half  iteration  is  just  a "normal"  SOR  iteration.  The  second  half 
iteration  is  another  SOR  iteration  but  taking  the  equations  in  reverse  order. 


Even  though  one  SSOR  iteration  is  composed  of  two  SOR  iterations,  there 
is  a way  to  reduce  the  work  required  for  each  SSOR  iteration  by  providing 
storage  space  for  an  extra  N-vector.  How  this  is  accomplished  can  be  seen  by 
explicitly  exhibiting  two  full  SSOR  iterations  and  noting  that  certain  vectors 
are  repeated  in  a pair  of  half-iterations. 

u(n+3s)  = »(Lu(n+!s)  + Uu(n)  + c)  Ml  - «)  u(n) 

save  Li/n+3^ 

u(n+1)  = w(Lu(n+!s)  + Uu(n+1)  + c)  (1  - co)  u(n+Js) 

save  Uu(n+1) 

U(nt3/2>  = u(Lu<"«/2>  ♦ Uu(nt’>  . c)  ♦ (1  - „) 

save  Lu^n+^2^ 

u(n+2)  = <o(Lu(n+3/2)  + Uu(n+2>  + c)  + (1  - co)  u(n+3/2) 

save  Uu^n+2^ 
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This  scheme  is  due  to  Niethammer  [1964].  The  work  required  per  iteration  using 
this  technique  is  approximately  the  same  as  with  the  SOR  method. 


We  noted  that  the  SSOR  method  can  be  written  as 
u<n+l>  . s u(n)  + k 
with  and  appropriately  defined. 

Evidently,  SSOR  is  a stationary  iterative  method  of  first  degree.  It 
is  stationary  because  and  kw  are  fixed  from  iteration  to  iteration.  It  is 
of  first  degree  since  u^n+ ‘ ) depends  only  on  one  preceding  iterate  u(n). 

It  is  well  known  (Young  [1971])  that  we  must  have  S^)  <lin  order  to  guarantee 
convergence.  Here  $ (Sw)  denotes  the  spectral  radius  of  S^.  This  conditions  holds 
if  0 < w < 2 and  A is  positive  definite.  Also,  if  we  define  the  error 


(n) 

e ' = u 

-u 

where  u is  the  exact  solution  of  Au  = 
appropriate  norm) 

1 |t(n)!  1 

= s (SJ  II  e*"'1*  I 

and 

i|e(n)n 

= [S(Su)]n||e(0)|| 

To  reduce  | |e^ | | 

to  a fraction,  say 

I |e(n)! I 

= [S(s„)]n  < t 

w 

.(0), 


we  must  have 


and  we  must  iterate  n times,  where 


n 


.-log  $ . 

-log  S(SJ 


We  define  R(SW)  s -log  S (S^)  as  the  rate  of  convergence  of  the  SSOR  method. 

As  indicated,  we  suspect  that  the  rate  of  convergence  of  the  SSOR  method 
R(SJ  = -log  S (Sj 

depends  on  «.  Not  evident  from  our  notation,  the  rate  of  convergence  is  also 
governed  by  the  ordering  of  the  equations.  Given  an  ordering,  the  optimum  u 
will  be  considered  to  be  the  u which  minimizes  S(SW)  or  maximizes  R(S,.). 
Assuming  the  "natural  ordering",  for  a certain  "good"  choice  of  u = which 
depends  on  upper  bounds  of  S(B)  and  S(LU),  Young  [1974]  has  shown  that  the 
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number  of  iterations  necessary  to  solve  the  discrete  generalized  Dirichlet 
problem  is  proportional  to  h'1 . This  is  the  same  order  of  magnitude  required 
by  the  SOR  method  with  optimum  w. 

Thus,  even  if  Niethammer's  work-saving  techniques  are  employed,  there 
seems  to  be  little  justification  to  choose  SSOR  over  SOR. 

However,  what  is  nice  about  the  SSOR  method  is  that  the  eigenvalues  of 
the  iteration  matrix  are  real  and  nonnegative  (Young, [1971]).  Under  these 
conditions,  it  is  possible  to  accelerate  SSOR  by  an  order  of  magnitude  by  means 
of  semi-iteration.  (This  is  not  possible  for  the  SOR  method,  since  many  of 
the  eigenvalues  of  its  iteration  matrix  are  complex  for  the  optimal  <d.) 

3.  SSOR  SEMI-ITERATION.  Semi -iteration  was  studied  by  Varga  (1957)  and 
Golub  and  Varga  (1961).  The  optimum  semi -iterative  method  based  on  SSOR, 
denoted  by  SSOR-SI,  is  defined  by 

“(n+1)  • o„*i  < FW.  “(n)  *v*n-a  .<•>»  * 0 - p„tI) 

Here 

2 

p " 2-S(S  ) 

v ur 


p2  = (1  - a2/2)_1 

2 l 

a p -1 

Pn+]  (1  ) j n = 2,  3,  ... 

where  S(S  ) 

° = a^srlr 

to' 

We  see  that  in  order  to  apply  SSOR-SI,  we  must  estimate  the  parameter 
S(SW)  along  with  w.  Using  estimates  derived,  again,  by  Young  [1974],  the 
required  number  of  iterations  is  proportional  to  h~%,  an  order  of  magnitude 
better  than  SOR. 

4.  OPTIMAL  PARAMETERS.  The  parameters  o and  S (S^)  determined  by  Young 
are  not  optimal.  Habetler  and  Wachspress  (1961)  determined  what  they  pre- 
sumed were  the  optimum  parameters: 

2 

“o 1 — 

1 + /I  - 2a  + 4g 
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1 


1 - a 


where 

a = 


) 


and  where  visa  vector  such  that  S v = $(S  )v.  However,  these  formulas  are 

(0  (i)„ 

o o 

not  usable  directly,  since  these  are  implicit  relationships.  Thus  we  are  unable 
to  determine  a>0  and  S(S  ) from  the  formulas  since  the  formulas  imply  that  we 

“o 

know  the  eigenvector  v (and  therefore  S(S  ) and  <o0). 

“o 

Evans  and  Forrington  [1963]  used  an  iterative  technique  based  on  these 
formulas  to  determine  the  optimum  w and  corresponding  S(SW).  Starting 
with  an  initial  guess  of  w,  the  power  method  on  the  matrix  S was  used  to 
determine  an  approximation  to  the  eigenvector  associated  with  S(SW).  Then, 
via  a and  e,  <u  and  S (-S  ) were  updated  by  the  formulas,  and  the  procedure  was 
repeated  until  <d  and  S (S  ) settled  down. 

This  procedure  worked  quite  well  for  Laplace's  equation.  However,  our 
numerical  results  indicate  that  for  certain  cases  where  S(LU)  < the 
Habetler-Wachspress  formulas  do  not  hold,  and  in  turn,  the  Evans-Forrington 
procedure  fails.  It  was  found  that  the  optimum  parameters  seem  to  be 


2 


Incorporating  these  formulas  into  the  Evans-Forrington  scheme  we  have  a 
new  method  for  determining  the  optimum  parameters  w0  and  S(S  ) for  the  SSOR-SI 
method.  The  algorithm  follows  wo 

1.  Choose  convergence  tolerances  e,  and  e?  and  initial  values  of  w and 

v f 0.  1 

2.  Iterate  with  the  power  method  to  obtain  S(S  ) and  a vector  v such  that 

' <D 

S v = S(S  ) v 

a)  a) 
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3.  Compute 

a = (v,  DBv)/(v,  Ov) 

6 = (v,  DLUv)/(v,  Dv) 


4.  Compute 

\ - ~ If  a < 4 6 

1 + A - 2a  + 4g 

U)  1 * , 

? if  a > 4 8 

1 + A - 48 


S '■ 


1 - a 

/T  -~7«  “46 


+ 


1 - a \ 

A~-  27  + * / 


If  a < 46 


1 - /I  - 48 

i ♦ /Hnr? 


if  a > 48 


5.  Terminate  process  If 

| M - «0  * | 

|S(SW)  - S'  I < c2 

and  choose 


~o 

S(S  ) - S’ 
wo 

Otherwise  set  w - »'  and  go  to  Step  2. 

5.  NUMER I CAL  EXAMPLE.  We  now  apply  the  algorithm  to  a specific  discrete 
generalized  Dlrlchlet  problem  of  the  form  (3).  In  particular,  let 

A(x,y)  a C(x,y)  ■ e^°^x+y^ 

F(x,y)  - T(x,y)  - 0 


and  let  the  region  be  the  unit  square  with  zero  boundary  values  except  unity  on 
side  y * 0.  Initially,  a "good”  u (Young  [1974])  was  chosen.  The  tolerances 
ei  ae2s  IQ-4  were  specified.  Results  are  given  In  Table  1. 
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TABLE  1 


OPTIMUM  PARAMETERS 

“0  and  S(S 

0 

) OBTAINED  BY  ALGORITHM 

h-1 

wo 

o 

3 

0 o 

Number  of 
Iterations 

S(LU) 

20 

1 . 5866 

.5866 

5 

.2329 

40 

1.7653 

.7653 

15 

.2455 

80 

1 .8742 

.8742 

35 

.2489 

The  Evans-Forrington  algorithms  did  not  converge  in  this  case.  For  a more  detailed 
discussion  of  this  example  and  a number  of  other  problems,  see  Benokraitis  [1974]. 
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VP  - SPLINES,  AN  EXTENT I ON  OF  TWICE  DIFFERENTIABLE  INTERPOLATION 

Royce  W.  Soanes  Jr, 

Watervliet  Arsenal,  Watervliet,  New  York 

1,  Introduction.  Since  the  cubic  spline  is  a special  case 
of  the  Variable  Power  - splines  presented  here,  the  latter  may  be 
regarded  as  a generalization  of  the  former.  This  generalization 
does  not  take  place  in  the  direction  of  obtaining  higher  degrees 
of  differentiability,  however,  but  rather  in  the  more  practical 
direction  of  obtaining  greater  flexibility  and  better  local  be- 
havior while  retaining  two  orders  of  differentiability.  The 
cubic  spline  has  the  optimal  property  of  being  the  interpolator 
of  smallest  quadratic  mean  second  derivative;  this  is  a global 
property,  however,  and  as  such  may  conflict  with  the  curve 
fitter’s  desire  for  certain  local  behavior.  Specifically,  the 
cubic  spline  has  a tendency  to  misrepresent  the  behavior  of  a 
function  which  passes  from  a region  of  low  curvature  through  a 
region  of  high  curvature.  The  cubic  may  underestimate  the  high 
curvature  and  overestimate  the  low.  This  tendency  may  be  mani- 
fested in  the  cubic  interpolater  by  the  presence  of  inflection 
points  where  none  are  desired.  VP  - splines  make  it  possible  to 
develop  a more  aesthetically  pleasing  functional  curve  from  a 
given  set  of  data  by  eliminating  or  at  least  diminishing  undesir- 
able local  behavior  when  it  is  encountered. 


NOMENCLATURE 


X1<X2<***<XN  a sequence  of  nodes  over  an  interval  on  which 
we  desire  to  interpolate  the  function  y. 

&i  = xi+rxi  * length  of  the  ^th  subinterval. 

y^  = known  function  value  at  the  node. 

y^,yV  = unknown  first  and  second  derivatives  at  the  ^th  node. 

Xi  W = interpolating  function,  derivative  and 

second  derivative  at  any  point  in  the  ^th  subinterval. 

rj  = (x-xi)/Jli 

= cyi+i-yi)/£i 

2.  The  Basic  Form.  Consider  the  following  interpolatory 
function  defined  on  the  ^th  subinterval. 

mi  ni 

Cl)  k^Cx)  = ai+biri+c^i  +di(l-ri) 

The  parameters  and  n^  are  positive  real  numbers  and 

+ni”inxni.  To  insure  a bounded  second  derivative,  m-j_  and  n^ 

* 

should  both  be  greater  than  2.  The  functions  y^(x)  and  y^(x) 
can  be  evaluated  at  and  x^+i  to  obtain  ai,bi,ci  and  d^  in 
terms  of  y^yi+^yi  and  y^i* 

(2)  &i  » kiyi+^iCraiqi-Cnii-Oyi-yi+i) 
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(3)  bA  = £iC-miniqi+miyi+niyi+1) 

C4)  ci  = «.iCniqi-yi-Cni-l)yi+i) 

(5)  dj  = ^i(-miqi+Cmi-l)yi+yi+1) 

The  interpolater  is  only  once  differentiable  at  this  juncture, 
so  we  enforce  continuity  of  its  second  derivative  at  the  ^th  node 
by  setting  y^(x^)  = yt-iCXi)  anci  obtaining  Eq.  (6). 

(6)  [tjini.iki  Cl-ki_i)  + S-i_xniki_i  (1-ki)  ]y[ 

= (ni-iqi_i-yi-i)+ 

Once  mi  and  ni  are  set  for  each  subinterval,  we  may  solve 
the  linear  system  represented  by  Eq.  (6)  (plus  end  conditions) 
and  obtain  the  nodal  derivatives  which  insure  the  continuity  of 
the  second  derivative  of  the  interpolater.  Setting  % a = 3 
on  each  subinterval  would  give  us  the  cubic  spline  formula. 

3.  Effect  of  Large  m's  and  n's.  Some  insight  into  the 
effect  of  increasing  the  m's  and  n's  may  be  obtained  from  the 
consideration  of  a special  case.  Equation  (7)  may  be  obtained 
from  Eq.  (6)  by  setting  n>i  = nj  = m on  each  subinterval. 

(7)  (m-lHV^i-^yl  = £i(mqi-l-yi-l)+Ai-ltmqi'>ri+l^ 
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It  is  apparent  from  Eq.  (7]  that  the  corresponding  linear  system 
will  become  increasingly  more  diagonally  dominant  as  m becomes 

I 

large,  and  in  the  limit  as  m approaches  infinity,  y^^  will  be 
given  by  Eq.  (8) . 

(8)  y!  = 

At  the  same  time.  Equations  (1)  through  (5)  show  that  y^x) 
approaches  (the  linear  interpolate^  as  m approaches 

infinity*  This  example  indicates  that  we  should  obtain  a well 
conditioned  set  of  linear  equations  as  we  judiciously  increase 
the  mfs  and  n's  to  obtain  the  flattening  effect  which  will  pro- 
duce desirable  local  behavior  in  the  VP  interpolator . 

4.  Elimination  of  Improper  Inflection  Points.  When  the 
cubic  spline  is  used  for  interpolation,  the  curve  fitter  may 

H „ 

be  faced  with  the  undesirable  situation  of  having  y-j/i+^O 

i i 

while  (q^-y^) Cqi-yi+i)<0*  These  conditions  indicate  the  presence 
of  an  inflection  point  in  the  interior  of  the  ^th  subinterval 
when  q^  is  between  y^  and  y!+^.  This  is  an  aesthetically  dis- 
pleasing situation  created  by  the  average  curvature  diminishing 
property  of  the  cubic  spline.  The  cubic  is  trying  to  deny  the 
existence  of  a point  of  high  curvature  in  the  function.  In 
order  to  eliminate  the  undesirable  inflection  point,  of  which 
there  can  be  at  most  one  per  subinterval,  we  must  insure  that 
yVyV+1>p.  This  is  equivalent  to  having  c^d^O,  We  therefore 
want  to  enforce  condition  (9)  in  the  event  that  q^  is  between 
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But  since  (q^-y^)  (qi-y;[+1)<:0 , condition  (9)  can  obviously  be 
enforced  by  insuring  that  and  n^  obey  conditions  (10)  and 
(11). 

(10)  ">i>(yi+i-yi)/(qi-yi) 

(11)  ni>(y{+1-yj)/(y^+1-qi) 

Conditions  (10)  and  (11)  must  naturally  be  enforced  in  an  itera- 
tive manner  if  two  orders  of  differentiability  are  desired,  since 
and  n^  must  be  set  before  y|  and  y{+j  are  obtained, 

5.  A Local  Consideration,  A local  VP  - spline  over  the 
restricted  node  set  [x^_ j ,x^+j]  wiH  he  found  useful  in  the 
initial  setting  of  the  mTs  and  n’s.  If  we  set  = 0 = 

y’i^l  (xi^l) , we  obtain  the  following  end  conditions. 

(12)  (nrl)y!+1  = i^q.-yj 

(13)  (mi.i-Dy!.!  = 

Equations  (12)  and  (13)  can  then  be  used  to  eliminate  y^+j  and 
y|j  from  Eq.  (6)  to  yield  Eq.  (14). 

(14)  y-  = (Aimi.1qi.1^i.1niqi)/(S,ilni.1+Jli-1ni) 

Equation  (14)  can  now  be  solved  for  the  ratio  of  n^  to  mi_1. 


(15)  R.  = n./m^j  = 

If  we  now  set  y*  at  each  node  according  to  some  preferred  local 
i 

formula,  we  may  set  the  m's  and  n's  according  to  the  following 
rule* 

(16)  nu  ^ » L and  n^  = LR^  if  R^>1 

n^  = L and  = L/R^  if  R^<f 

Rule  (16)  sets  a lower  bound  of  L on  the  m's  and  n's  and  assumes 
that  R^  is  positive  and  finite.  If  Eq.  (15)  and  Rule  (16),  are 
used  to  initialize  the  m's  and  n's.  Conditions  (10)  and  (11)  can 
be  used  in  conjunction  with  Eq.  (6)  to  iteratively  eliminate  any 
unwanted  inflection  points, 

A simple  local  formula  which  has  been  found  effective  in  practice 
for  initial  setting  of  the  nodal  derivatives  is  given  by  Eq.  (17), 

(17)  y!  = Cl-w.)  qt_x  * 

where  = 1/(1  + /[(l  + q?)/(l  + ] J 

This  derivative  is  obtained  from  the  slope  of  the  line  through 
(Xf,  y/)  which  makes  equal  angles  with  the  left  and  right  chords. 

When  this  formula  is  used,  it  is  usually  unnecessary  to  increase  the 
m's  and  n's  gradually  in  order  to  enforce  conditions  (10)  and  (11), 
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6.  A Simple  Computational  Procedure.  An  abbreviated  procedure 


which  can  be  followed  in  most  cases  is: 

I .  Set  L equal  to  a value  between  2 and  3 , Values 
greater  than  3 may  flatten  the  curve  too  much 
between  the  nodes.  Values  closer  to  2 will  produce 
more  roundedness  in  the  interpolator . 

II,  Compute  the  initial  nodal  derivatives  according  to 
Eq.  (17)  for  2 < i < N - 1. 

III.  Set  n^  - L = ^ and  calculate  R^  for  2 <_  i <_  N - 1 

using  Eq.  (15). 

IV.  Compute  the  rest  of  the  m’s  and  n*s  using  Rule  (16). 

V.  Solve  the  tridiagonal  set  of  equations  represented 
by  Equations  (6),  (12)  and  (13)  for  the  final  nodal 
derivatives. 

7,  Examples.  The  following  drawings,  which  were  produced  on 
a graphic  display  CRT,  illustrate  the  more  stable  behavior  of  the 
VP-spline  as  compared  with  the  oscillatory  behavior  of  the  cubic 
spline.  Each  of  the  VP-spline  curves  was  computed  using  the 
abbreviated  procedure  previously  outlined  [not  making  use  of 
conditions  (10)  and  (11)]. 
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ITERATIVE  SOLUTION  OF  THE  TRANSONIC  POTENTIAL  EQUATION 


D.  C,  Adams  and  Gary  Vander  Roest 
U.S.  Army  Air  Mobility  R&D  Laboratory 
Ames  Directorate 
Moffett  Field,  California  94035 


ABSTRACT.  The  unsteady,  transonic  small-disturbance  equation  is  a suit- 
able model  for  the  flow  on  advancing  rotor  tips.  This  paper  discusses  the  rela- 
tive efficiency  of  two  implicit  iterative  techniques  used  in  the  solution  of 
this  nonlinear  equation;  namely,  the  point  SOR  technique  and  the  Douglas-Gunn 
ADI  technique.  While  the  two-dimensional  calculations  of  both  methods  demon- 
strate the  effect  of  varying  lift  and  incident  Mach  number  on  a high-speed  heli- 
copter rotor,  the  ADI  technique  proved  to  be  the  more  efficient  method, 

1,  INTRODUCTION.  The  advancing  blade  of  a helicopter  in  high-speed  for- 
ward flight  often  enters  the  transonic  flow  regime.  This  type  of  flow  is 
marked  by  the  presence  of  local  pockets  of  supersonic  flow  which  are  usually 
terminated  by  a shock.  A feature  of  transonic  flow  which  is  beginning  to  be 
considered  is  that  these  flows  are  intrinsically  unsteady.  This  is  especially 
important  for  the  helicopter  in  forward  flight,  since  the  rotor  sees  a con- 
stantly varying  incident  Mach  number  and  angle  of  attack. 

The  first  treatment  of  this  problem  was  the  paper  of  Caradonna  and  Isom 
(ref.  1)  where  nonlifting  three-dimensional  flows  were  considered.  It  was 
shown  that  the  Mach  number  variation  alone  is  a considerable  source  of  unsteadi- 
ness. ^his  treatment  used  a point  SOR  scheme  to  solve  the  set  of  nonlinear 
difference  equations  derived  from  a mixed  differencing  of  the  small-disturbance 
potential  equation.  However,  SOR  is  often  not  the  fastest  technique  at  our 
disposal.  In  this  paper,  an  iterative  ADI  scheme  to  solve  the  nonlinear  system 
of  difference  equations  is  devised  and  then  compared  with  the  SOR  method. 

2.  STATEMENT  OF  PROBLEM.  The  second-order  nonlinear  partial  differential 

equation  to  be  solved  is 


where 


A0tt  + 


B<J> 


xt 


=*  C<|>  + 

XX 


yy 


A 

B 

c 


m|/(AR262/3  ) 

[2M 2/ (AR<52/3)]  (1  + y sin  i|>) 

rl  - M2(l  + y sin  ij;)2 

— — ' — ' ' (1  + Y)M|(1  + v sin  ij>)<|>x 


<j>  = disturbance  potential 
t = time 

x = chordwise  coordinate 


a) 
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y - normal  coordinate 
X = x/c 
y = (y/c)6l/3 

and  where 

AR-  aspect  ratio  = r/c. 
c = blade  chord 

= blade  incident  Mach  number  due  to  rotation 
r = blade  radius 
x,y  = physical  coordinates 
y = specific  heat  ratio 
6 = thickness  ratio 

y - ratio  of  forward  flight  speed  to  rotational  speed,  V^/oor 
ip  = blade  azimuth  angle  (measured  from  the  downwind  direction) 

The  linearized  body  boundary  conditions  are  given  by 

♦y(x’0±)  ■ to  (2) 

where  fu  £(x,y),  the  equation  for  the  upper  and  lower  airfoil  surface,  is 
given  by 

fu,i  -i[Tu,i<*>  * f] 

where  Tu  ^ is  the  thickness  distribution  and  a is  the  blade  angle  of 
attack. 


The  far-field  boundary  conditions  are  given  by 


<Kx,y) 


x 


00 


(4) 


where 


lift  per  unit  length 
[ (l/2)pv2]c 


and  where 

v = total  velocity  of  the  rotor 
p = air  density 
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The  right “hand  side  of  eq.  (1)  is  of  mixed  type,  being  elliptic  when  flow  is 
subsonic  (ML  < 1)  and  hyperbolic  when  flow  is  supersonic  (Hl  > 1). 


Equation  (1)  is  differenced  as  follows: 

N+l  N+l  N+l  N+i  N+i  N+l  N+l 

+ BS+h  - c.CM  S T1  + (1  - e . )C«  J + 4 *M+1  (5) 

tt  xtr  XI  xx ' x-1  i-l  xx ' yyT  7 


where 


N = time  level 
M = iteration  level 
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e = 1,  e_._1  = 1 for  ML  < 1 

e.  = 0,  e.  = 0 for  MT  > 1 

i i-l  L 

e.  = 0,  e.  - 1 for  Mt  = 1 

l i-l  L 

e,  = 1,  e.  , = 0 for  shock  point 

li-l 


The  epsilon  notation  denotes  that  backward  and  central  differences  in  the 
x-direction  are  used  in  supersonic  and  subsonic  regions,  respectively.  The  use 
of  mixed  differences  is  required  for  stability.  Also,  this  notation  indicates 
that  the  sum  of  the  central  and  backward  operators  are  used  at  a shock  point. 
This  is  necessary  in  order  to  satisfy  the  equations  in  the  global  sense  and 
insure  proper  shock  jump  conditions. 
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(6) 


The  differencing  of  <p  is  modified  on  the  body  as  follows : 


2 

d,  - (<b.  - d> . - Ayd)  ) 

yy  Ay2  x>2  1>1  y 


For  the  lifting  problem,  the  potential  must  be  discontinuous  in  the  wake 
of  the  rotor.  The  difference  in  potential  at  this  discontinuity  is  equal  to  the 
circulation,  T.  In  order  to  difference  across  this  discontinuity,  <f>  dif- 
ferencing is  modified  as  follows: 


+ 


h.2-2h.i  + (h.-itr) 

Sy2 


where 

r,^_ 

262/2  ' 

Differencing,  as  above,  yields  a set  of  nonlinear  algebraic  equations  which  must 
be  solved  by  iteration.  In  the  course  of  iteration,  one  can  specify  either  CL 
and  iterate  to  find  the  angle  of  attack  (a) , or  specify  a and  iterate  to 
find  C . In  the  former  case,  a must  be  updated  until  the  jump  in  potential 
at  the  trailing  edge  equals  T . This  is  done  using 


aN+1  = aN  - l^T(xTe,0)  - 4>B (xTe » 0)  _ r] 

where  X is  a relaxation  factor  (usually  equal  to  one) . 

3.  SOR  METHOD  OF  SOLUTION.  For  the  SOR  scheme,  the  potential  of  each 
iteration  is  updated  using 


(7) 
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where  U 


M 


is  the  residual  that  is  defined  by  eq.  (5),  with  all  terms  grouped 


WllCIC  -j  — — — — — — “ J 1 T 

on  the  right-hand  side.  D^j  is  the  sum  of  the  diagonal  elements  of  the 
residual: 
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(10) 


The  relaxation  factor  m is  less  than  1 in  supersonic  regions  and  between  1 
and  2 for  subsonic  regions. 


In  order  to  start  the  problem,  the  first  two  time  steps  are  assumed  to  be 
quasi-steady  (A  and  B are  set  equal  to  zero) . This  procedure  causes  no  problems 
as  long  as  the  starting  Mach  number  is  subcritical. 
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4,  DOUGLAS- GUNN  ADI  METHOD  OF  SOLUTION.  With  this  method,  it  has  been 
found  necessary  to  include  an  artificial  $ -like  term  in  eq.  (5)  for  stability 
purposes.  We  now  have  the  modified  difference  equation 
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We  have  not  determined  the  optimal  value  of  F.  However,  F = 40B  seems  to  work 
well  for  most  uses.  For  convenience,  eq.  (11)  is  rewritten  in  an  operator  nota- 
tion as 
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where 
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M - iteration  index 
N = time  index 

S ] = spatial  operator  in  the  x-direction  = 
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The  Douglas-Gunn  scheme  (ref.  2)  is  employed  to  generate  the  ADI  routine  because 
it  is  generalizable  to  any  number  of  dimensions.  Note  that  the  term 
(Kl  + no  + S1  + S2)  contains  all  the  unknowns.  The  original  Douglas-Gunn  scheme 
does  not  include  the  6X  operator  because  they  did  not  consider  equations  that 
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had  mixed  spatial-time  derivatives.  The  general  Douglas-Gunn  recursion 
tion  for  generating  ADI  schemes  for  any  number  of  dimensions  is 
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(13) 


when;  q i.s  the  number  of  dimensions,  which  is  2 for  our  case,  and  ip*  is 
some  extrapolation  from  previous  time  steps.  Generally,  we  have  let  <|>*  equal 
the  previous  iterant.  The  first  step  of  the  two-dimensional  case  is  as  follows: 


1 

(KI  + n<5  + S})  <)>  + S2<|>*  + 2 To<t)N  = g +1  (14) 

x £=0  1 


The  coefficient  of  <p,  having  both  central  and  backward  differences,  gives  rise 
to  either  a central  or  lower  tridiagonal  matrix.  This  matrix  is  solved  by  a 
combination  of  the  Thomas  algorithm  and  direct  elimination.  Substituting  in 
eq.  (13)  for  the  second  step  (q  = 2),  we  get 
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Subtracting  eq.  (14)  from  eq.  (15),  yields  a simplified  step  2,  which  is  as 
follows : 

N+l 

(KI  + q6  + S2)4>Mfl  - (KI  + qS  )<f>  - S2<J>*  = 0 (16) 

X X 

KI  and  S2  generate  a tridiagonal  matrix.  However,  n6  adds  a term  which  is 
on  a previous  column.  This  column  must  be  solved  before  we  can  move  to  the 
next  column.  This  imposes  a necessary  order  on  the  process  and  the  tridiagonal 
inversions  march  across  the  grid  from  left  to  right.  To  get  started,  we  let 
the  first  two  time  steps  be  quasi-steady  by  setting  the  time  index  (n)  equal  to 
the  iteration  index  (m) . 


5.  RESULTS.  Numerous  cases  were  run  by  both  the  SOR  and  ADI  methods, 
using  various  Mach  number  and  lift  variations.  The  greatest  difference  between 
these  calculations  and  the  results  of  ref.  1 is  the  inclusion  of  lift.  It  was 
noted  in  ref.  1 that  upstream  wave  propagation  necessitated  placing  the 
upstream  boundary  at  least  10  chords  upstream.  With  lift,  however,  this  situ- 
ation becomes  much  more  acute,  and  it  was  necessary  to  place  the  boundaries 
20  chords  away  (upstream  and  downstream)  in  order  to  obtain  stable  solutions. 
The  ADI  method  was  generally  more  reliable,  as  the  SOR  method  often  failed  to 
converge  at  higher  values  of  lift  and  Mach  number.  However,  when  the  SOR 
method  did  converge,  it  agreed  almost  exactly  with  the  ADI  method.  In  these 
situations,  however,  the  ADI  method  appeared  to  be  about  twice  as  fast  as  the 
SOR. 

A case,  from  which  identical  results  were  obtained  from  both  methods,  is 
shown  in  fig.  1,  In  this  case,  we  specify  a sinusoidal  and  find  the 

angle  of  attack.  The  rotor  in  the  center  of  fig,  1 indicates  the  various 
azimuth  angles  for  the  Cp  plots. 
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Unsteadiness  is  indicated  by  the  effect  of  flow  history  on  the  flow. 

For  example,  at  the  rotor  positions  A and  I,  the  Mach  number,  angle  of  attack, 
and  lift  are  the  same.  Also,  the  solutions  obtained  are  so  identical  that  they 
are  plotted  on  top  of  each  other.  Clearly,  the  flow  history  plays  no  role  at 
these  points  and  the  flows  are  essentially  steady.  At  blade  position  D,  the 
lift  is  specified  to  be  zero,  and  the  solution  obtained  has  identical  flows  on 
the  top  and  bottom  surfaces.  This  is  what  one  expects  from  steady  flows.  How- 
ever, one  would  also  expect  a zero  angle  of  attack  from  a steady  solution 
instead  of  the  slightly  negative  angle  that  was  actually  obtained.  The  nega- 
tive angle  implies  that  the  flow,  up  to  this  point,  is  unsteady  but  only 
slightly  so. 

In  the  subsequent  blade  locations,  the  flow  Is  decelerating  and  the  lift 
is  again  increasing.  The  shock  is  seen  to  move  forward  but  does  not  'diminish 
much  in  strength.  In  fact,  on  the  bottom  surface,  the  shock  does  not  go  to 
zero  strength  at  all.  Instead,  it  appears  at  position  G to  be  on  the  verge 
of  popping  off  the  leading  edge  and  proceeding  out  into  space.  This  effect 
was  first  noted  in  ref.  1 for  nonlifting  cases  and  is  seen  with  great  clarity 
in  ref.  3,  which  uses  an  ADI  technique  very  similar  to  this  one.  On  the  top 
surface,  the  increasing  lift  allows  for  the  sustenance  of  the  supersonic 
region,  and  there  is  no  shock  popping.  However,  the  shocks  are  substantially 
stronger  than  at  their  corresponding  azimuths  in  the  accelerating  flow  region. 
For  example,  position  H is  the  mirror  image  of  position  B,  but  the  Cp  dis- 
tributions for  the  upper  surface  are  quite  different. 

Clearly,  the  effect  of  unsteadiness  is  large  in  a decelerating  transonic 
flow.  In  addition,  where  there  is  lift,  the  difference  in  unsteadiness  can 
be  quite  different  on  the  top  and  bottom  surfaces. 

6 . CONCLUDING  REMARKS . The  modified  Douglas-Gunn  ADI  is  about  twice  as 
fast  as  SOR  and  generally  more  reliable.  Present  computing  times  are  about 
20  minutes  on  a CDC  7600  for  a two-dimensibnal  lifting  problem.  This  can  be 
greatly  improved  with  more  efficient  programming,  a less  rigid  convergence  cri- 
terion, and  the  use  of  a variable  time  step.  An  interesting  approach  is  taken 
in  ref.  3,  where  the  nonlifting  two-dimensional  problem  is  treated,  using  a 
Douglas-Gunn  scheme.  In  this  approach,  the  difference  equation  is  linearized 
in  time,  thus  eliminating  the  iterations,  but  requiring  a smaller  time  step. 

It  appears  that  with  a combination  of  the  above  approaches,  unsteady  three- 
dimensional  calculations  would  be  a practical  proposition.  We  are  now  working 
toward  this  end. 

It  appears  that  the  inclusion  of  unsteadiness  in  transonic  flow  predic- 
tion is  at  least  as  important  (and  likely  more  important)  than  the  effect  of 
three  dimensionality.  Decelerating  flows  are  altogether  different  from  their 
quasi-steady  counterparts.  Also,  lift  raises  the  possibility  of  very  unusual 
loads  in  the  second  rotor  quadrant. 
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UTILIZING  REAL-TIME  TEST  DATA  ANALYSIS  IN 
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ABSTRACT.  For  the  proper  utilization  of  equipment  needed  for  test 
data  on-line  acquisition,  display,  and  comparative  analysis,  some 
novel  data  management  techniques  are  in  the  process  of  development. 

For  on-line  analysis  of  test  display  data,  the  recognition  of 
changing  values  in  the  inherent  uncontrolled  design  parameters  and 
system  output  variables  must  be  basic.  Also,  the  display  technique 
must  accommodate  a reasonable  number  of  sets  of  operating  and  envi- 
ronmental conditions , 

We  must  be  able  to  identify  shifts  in  the  values  of  the  parameters 
and  variables  as  well  as  shifts  in  the  weighted  contributions  of  each 
parameter  to  the  output  system  variable  value.  Each  contribution  is  the 
product  of  the  quantified  parameter  value  under  a given  condition  set 
and  the  corresponding  influence  coefficient.  This  coefficient  reflects 
the  sensitivity  of  the  output  variable  to  a nominal  change  in  the  pa- 
rameter when  measured  at  the  given  condition  set.  In  addition,  we  must 
identify  any  changes  in  the  gains  of  the  functional  equipment  components 
and  variations  in  the  transmission  characteristics  of  the  test  data  trans- 
port and  management  system.  An  observation  matrix  of  the  coefficients, 
parameter  values,  or  contributions  provides  the  display  vehicle. 

The  study  adopts  a model  consisting  of  seven  parameters  and  five 
sets  of  conditions.  Variations  of  all  quantities  are  demonstrated  and 
companion  computations  made  to  show  the  significance  of  the  variations 
and  their  identification. 
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I.  INTRODUCTION. 

One  of  the  significant  developments  in  process  to  be  applied  to 
the  test  analysis  and  evaluation  of  Army  materiel  systems  is  the  real 
time  use  of  the  computer  for  the: 

1.  On-line  acquisition  and  analysis  of  data. 

2.  Display  of  the  test  data  while  the  system  is  under  test. 

3.  Making  comparisons  of  this  test  data  with  stored  data  from 
a developed  reservoir  of  historical  data  about  similar  and  current 
systems.  Knowledge  of  the  testing  subsystem  equipment  should  be 
available  for  ready  reference  (see  Figure  1) . 

One  such  testing  system  is  in  the  design  and  planning  stage  and  called 
project  ADAPT  (Automatic  Data  Acquisition  and  Processing  Technology). 

This  program  is  under  the  direction  of  the  Materiel  Testing  Directorate 
of  the  Aberdeen  Proving  Ground. 

A practical  technique  for  on-the-spot  analysis  of  test  display  data 
is  needed.  For  such  on-line  analysis,  not  only  central  or  "expected" 
values  for  the  parameters  must  be  displayed  at  the  centralized  control 
and  display  area.  We  must  identify  shifts  in  the  values  of  the  uncon- 
trolled parameters  of  the  system  under  test  and  the  testing  and  instru- 
mentation system.  In  addition,  we  must  identify  any  changes  in  the  gains 
of  the  functional  components  of  equipment  and  the  variations  in  the  trans- 
mission characteristics  of  the  data  transport  and  management  system. 

To  assist  in  the  understanding  of  utility  and  potential  application 
for  such  a system,  a study  is  described  where  the  data  display  is  eval- 
uated as  a means  for  both  real  time  system  monitoring  and  confidence 
status  or  checkout.  A display  model  for  the  test  item  and  the  testing 
equipment  will  be  applied  to  a planned  group  of  activities.  These 
activities  are  defined  as  the  set  of  subtests  planned  and  executed  as 
a part  of  the  Test  Plan  and  defined  by  a Test  Directive.  One  approach 
to  this  model  is  the  adoption  of  a Data  Observation  Matrix  as  the  display 
vehicle.  This  model  has  found  use  wherever  one  applied  techniques  such 
as  systems  modeling  by  functions  (see  Reference  1)  on  parameter-sensitivity 
coefficients  (see  Reference  2).  Figure  2 describes  a general  form  for  such 
an  observation  matrix. 
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It  will  be  our  purpose  to  illustrate  the  application  of  analysis 
techniques  for  a matrix  model  of  seven  parameters  and  five  condition 
sets  of  grouped  activities.  The  complete  study  of  this  model  involves 
the  display  for  the  time  variations  in  the  participating  parameters  and 
their  dimensionalized  contributions  and  changes  in  the  parametric 
sensitivities , 

II,  DISCUSSION. 

A,  Mathematical  Background, 

Utility  of  Observation  Matrix  for  Data  Display: 

One  evaluation  criterion  for  usefulness  for  any  method  of 
information  identification  and  cataloging  is  that  the  variationals  of  all 
system  parameters  be  recognizable # definable,  and  predictable.  We 
have  suggested  that  the  mathematical  and  informational  system  model  for 
a given  influence  level  be  described  by  the  coefficients  array  for  the  ap- 
plicable engineering  equipment  and  physical  constraints. 

The  influence  level  is  defined  by  the  operating  subsystem  or 
data  system  gross  functions,  K or  mission  phase,  the  data  functional  model 
subfunction  or  component,  activity  and  agency,  etc.  The  (pj)  are  the 
measured  values  for  the  Xj  parameters  or  the  set  of  values  found  as  the  set 
to  maximize  or  minimize  the  criterion  - our  measure  of  adequacy/excellence. 

Two  cases  are  of  immediate  interest.  For  a change  in  system 
time  of  magnitude  (At): 

(1)  A shift  occurs  in  the  system  parameter  measurable  values 
(polarity  included) , 

(2)  The  shift  occurs  in  the  influence  coefficients,  which 
represent  either  the  compatibility  potentials  or  sensitivity  potentials  of 
the  equipment.  Transfer  or  data  transmittal  characteristics  are  the  inter- 
dependency factors  for  the  variables  and  the  system  conditions. 

In  both  cases,  the  display  integrated  parameters  and  contributions  to  the 
system  objective  functions  are  changed  in  magnitude.  A dependable  tech- 
nique must  be  established  for  determining  whether  the  shift  is  made  up 
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of  type  (1) , (2) , or  some  combination  of  these.  Variation  with  system 
condition/configuration  (J)  should  be  identifiable. 

A numerical  example  with  seven  parameters  and  five  conditions 
or  configurations  which  define  the  interdependence  relationships  will  be 
used  to  illustrate  both  cases  (1)  and  (2) . The  example  will  also  show  a 
variation  with  (J)  for  one  of  the  seven  parameters.  We  will  study  these 
for  a system  time,  t = tj , and  a time  change  A tj , a second  time  t2  and 
time  change  A t2-  The  data  are  available  for  a consideration  of  the 
variations  consistent  with  the  time  interval  (t2  - tj) , as  well  as  those 
mentioned . 


For  Case  (1) , where  the  shift  occurs  in  the  parameter  values , 
the  supporting  analysis  follows: 

The  change  in  objective  criterion  equals  — 


E*  - z = Bj  (xj  - xj)  + . . . + 8n  (xn  - xn) 


(.01) 


The  corresponding  change  in  subsystem  capacity  potential  or  integrated 
parameter  is 


wKi  " wKi  * alKi  <X1  ~ xi)  + ...  anKi  (xn  - xn) 


(.02) 


E*  - 2:  = E ^w*i  - wKi) 


(.03) 


m n 


2 E a1Kj  (Xj  - Xj) 

i J 


E 8 j (xj  - Xj)  (.04) 

J 


m n 


_ ^ (aiKJ^  ( 6 Pj)  - E (8j)  ( 6 pj)  (.05) 
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(Note:  The  starred  quantities  (previous  page)  are  values  for  variables 
at  (t^  + At)  and  unstarred  those  at  ti .) 


A check  test  is  performed  in  which  all  the  stimuli  parameters 
Xj  are  set  at  a fixed  constant  value  (normalized, this  is  equivalent  to 
unity,  1) . If  this  will  give  a zero  change  in  the  objective  function 
£ = £*  the  valid  conclusion  is  that  there  have  been  no  changes  in  the 

influence  coefficients  ^a^j^  , Superscript  (1)  identifies  check  values. 

If  both  ( ]T  * - Ya ) and  ^JK  “ SJK)  have  a value  which  is 

J I J 

non-zero  for  the  test  of  fixed  constant  values  for  the  parameters,  then 
the  variations  are  in  the  (a^j) . Mathematically,  for  the  change  in  the 

coefficients,  we  state  that: 


(i)  U)  tt> 


r-2  -i  si 

J J J i 

-f( 


*JK  aiJK 

O 

AV 

(.06) 

? J ) < 0 ; 

xj  = p!  = pj  = 1 

(.07) 

For  a shift  in  the  parameter  values,  we  state  that; 
[x.-Xj]  = (aXj) 


- KjiSj)  I ( »A  ) 

l(  VO/  ( v1  ) 

K ' 'K  'K 

At 


(•08) 


for  Xj  Xj  = Pj  = 1 

The  time  sensitivity  potential  for  the  parameter  Xj  is  found 
by  dividing  the  expression  and  value  for  A Xj  for  ihe  time  shift  At 
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by  the  time  interval  A t.  This  sensitivity  is  identical  to  the  time  rate 
of  change  of  the  variable  Xj. 


At  change 
K 


(.09) 

t,K 

V1  / 

\ 

At  ) / 

6*b51)x!  - 6J1>BA  f(,10> 

1 K 

= teJ  (1+  -s^-^  (-H) 

j 

When  the  change  is  in  the  influence  coefficients  and  not  in 
the  Xj  parameters , we  must  answer  the  selection  question  regarding 
equipment  deterioration.  "Do  we  need  to  determine  the  individual 
change  A(ajjK)  °r  t'ie  resultant  effect?"  Let  us  consider  the  mathe- 
matical forms  needed  to  answer  this  question. 


■(? 


A Ml 


L - xi<ibj> 


AwKi  * A 


| 2 aiJRXj 


(AaljK)Xj  (.13) 
for  AXj  = 0 


*Mr'H 

For  the  particular  case  where  Xj  = 1 

2*  ■ Z ) " 0 for  iaiJK  * ° 


lf  “lJK  * ° 
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The  individual  ( Aa^)  are  needed  in  order  to  determine  or 
verify  the  change  in  the  subsystem  output  variable  to  be  derivable  from 
the  summed  weighted  influence  of  the  parameters  (see  Equation  13), 
Fortunately,  our  matrix  technique  displays  all  these  quantities  and  they 
are  recognizable. 


For  our  Case  (2) , where  the  shift  is  in  the  coefficients  and 
not  the  parameters,  we  know  these  facts: 


r-z 


-Z(  l.-v  ) 

i 

= E XJ  S ( aiJK  “ aiJK  ) 
J L 


2 ( A ai]K^ 

i 


(.17) 


(.18) 


= Z ej-Sj) 

J V K 

= Z X,  (4Bj)k  . 

J 

Since  our  variations  with  At  are  in  a^,  we 
then  we  find: 


(.19) 


set  Xj  equal  to  plus  one, 


Li) 

r-z  = 


value  = 
K 


2 (A0j)  > 0 (.20) 

J 


J i 


(Aa.JK)  . 


(.21) 
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A priority  ranking  can  be  found  from  this  ratio: 


(AB(J+l)  JXj  = 1 


B.  Illustration  of  Technique. 


(.22) 


We  will  study  a system  whose  model  consists  of  seven 
parameters  subjected  to  five  constraints.  These  constraints  describe 
the  system  parameter  interrelationships  under  five  sets  of  conditions . 


Our  five  system  conditions  or  constraint  inequalities  form 
the  rows  for  our  Influence  Matrix.  Seven  dominant  parameters,  which 
are  interdependent,  form  the  headings  for  the  columns  in  the  array. 

Nine  tables  are  attached  which  illustrate  the  application  of  our  technique 
for  identification  of  the  changes  in  status  for  the  operating  equipment. 

Let  us  consider  one  system  situation  which  could  be  represented  by  the 
coefficient  array  of  Figure  2 . The  integrated  subsystems  may  be  described 
by  the  seven  dominant  and  critical  parameters  whose  measured  values  are 
available  and  accessible  as  the  outputs  of  the  several  position  references, 
inputs  to  the  circuits  for  the  power  controller,  and  inputs  to  the  path  con- 
trol circuits.  The  five  system  conditions  might  define  the  several  constraints 
for  a land  vehicle  during  the  conduct  of  a particular  set  of  maneuvers . 


Table  1 has  cell  values  of  a^  at  the  system  operating  time  tj. 
These  coefficients  are  the  interdependence  transfer  admittances  or  com- 
patibility potentials . The  CKi  are  the  integrated  display  parameters  for 
the  several  system  conditions  and  represent  the  limitation  on  the  seven 
term  sum  of  the  product  of  the  interdependence  coefficients  and  the  appro- 
priate stimuli  or  dominant  parameters  (for  the  specialized  case  of  unity 
value  for  each  of  these  dominant  parameters) . The  bj  are  the  corresponding 
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SENSITIVITY  - INFLUENCE  TABLE  FOR  SHIFT  IN  PARAMETER  VALUES  WITH  (At) 


u 

Xl 

X2 

X3 

X4 

X5 

x6 

x7 

cKi  <pJ  - 

1) 

Ai 

863 

450 

130 

445 

180 

550 

190 

2808 

A2 

804 

450 

130 

445 

180 

95 

190 

2124 

a3 

960 

675 

155 

525 

180 

120 

200 

2623 

A4 

450 

450 

104 

195 

72 

50 

100 

1385 

A5 

720 

450 

86 

455 

151 

100 

180 

, 1925 

h? 

i 

bj  st 

3797 

2475 

605 

2075 

771 

915 

940 

Aj  bj 

|<Pj  = 1) 

k- 11574 

pj  (^) 

1.903 

0.641 

1.000 

0.738 

0.406 

0.861 

0.515 

\ 

i 

Pj  <t 2 +At2) 

2.135 

0.898 

0.593 

0.896 

0.056 

0.166 

0.095 

A Pj  (At2) 

0.230 

0.  257 

0.407 

0.158 

0.350 

-0.695 

-0.420 

Apj[it2  + 

It  2 - tj)] 

0.128 

0.311 

0. 385 

0.014 

-0.  365 

- 0.682 

+ 0.428 

pj  <ti) 

2.007 

0. 587 

0.208 

0, 882 

0.421 

0.848 

0,523 

bj  Pj  at  tx 

7592 

1452 

126 

1829 

322 

776 

491 

\ = 12588 

\T  ) 

K at  t2 

TABLE  1 


capacity  potentials  for  the  unity  measured  parameter  values . This  table 
gives  the  parameter  values  for  three  system  times — (tj),  (t2) , and 
(t2  + A . 


Table  2 gives  the  array  of  cell  values  at  system  time  (12)  and 
each  is  the  contribution  at  the  corresponding  i_  system  condition  to  the 
display  parameter.  In  addition,  an  optimization  study  was  performed 
for  the  given  array  to  find  the  set  of  weighting  coefficients  or  optimal 
set  (pj*)  for  a maximization-minimization  of  the  criterion.  This  set  has 
significance  for  an  overall  performance  evaluation  of  the  operating 
subsystem  rather  than  a status  or  diagnosis  study. 

Table  3 provides  the  array  of  display  contributions  at  time  tj. 
These  values  present  rather  obvious  changes  in  cell  values  from  those 
shown  in  Table  2. 

Table  4 portrays  the  contributions  at  system  time  (t2  + At2) 
in  contrast  with  those  identified  as  a part  of  Table  2 at  (t2). 

An  optimization  study  was  also  performed  at  this  system  time 
to  find  the  set  (pj)*. 

Table  5 gives  the  variations  in  the  display  contributions  for  the 
time  change  of  ( At}).  This  array  is  used  to  illustrate  our  Case  (1)  where 
the  shift  occurs  in  the  parameter  measured  values  with  no  changes  in 
(aijK) . The  change  in  the  integrated  display  parameter  w^  is  given  for 
each_i  system  condition.  The  percentage  change  in  the  parameter  value 
measured  at  tj  is  also  determined.  This  set  of  values  shows  the  sig- 
nificant dominance  of  Xj  for  J = 1 , 2 , and  4 . 

Table  6 gives  the  variations  in  the  display  contributions  for 
the  time  change  of  ( A t2).  The  pj  corresponding  to  the  Xj  are  considered 
fixed.  The  change  in  A is  a positive  quantity  282.  However, 

the  summation  Z (A0  ) is  the  negative  quantity  (-561). 

J 1 

Table  7 is  a companion  array  to  Table  6,  but  giving  the  changes 
in  atjK  rather  than  5 [a^fpj^the  variation^  (pj)  5 [aiJK  3 ■ The  sum- 

mation of  the  variations  in  a^  first  with  respect  to  i,  and  fhen  J is  a 
negative  quantity  (-540).  This  array  is  numerically  identical  with  the  test 
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TABLE  OF  CONTRIBUTIONS  (aiJK  Pj) 

AT  t = t2 

V* 

*1 

*2 

*3 

X4 

X5 

*6 

x7 

wKi 

fKi 

A| 

1644 

289 

130 

328 

73 

473 

98 

3035 

3322 

a2 

1531 

289 

130 

336 

73 

82 

98 

2539 

2832 

A3 

1826 

433 

155 

387 

77 

103 

103 

3084 

3418 

a4 

857 

287 

104 

144 

31 

43 

52 

1518 

1666 

a5 

1371 

289 

86 

336 

61 

86 

93 

2322 

2553 

7231 

1589 

605 

1531 

316 

787 

444 

2.  eJ  = 12503,  125 00=2*^ 

1 

(Pj)* 

1.346 

0.650 

1.312 

0.738 

0.612 

0.880 

0.490 

<6J  Pj)* 

9732 

1032 

794 

1129 

193 

692 

218 

j 

L fKl  ■ 12790  6,  fr 

r 

Individual  contribution  to  Is  ^ ai]K  j 

PJ 

(t2) 

‘ j' 

L 

to 

'Ki  “ 

l 

if  2) 

jpj* 

( Pj)  are  optimum  values  of  parameters  for  the  matrix  array  having  individual  cell 


designations  axJK  (f2)  • The  result  from  an  interative , convergent 
process  maximizing-minimizing  the  criterion^ g | 


TABLE  2 


TN1 


VALUES  OF  ( aijK  Pj)** 


AT  (t  = tx) 


T-+ 

*1 

x2 

*3 

X4 

*5 

*6 

><7 

<wKi>k 

A] 

1726 

264 

27 

392 

76 

466 

99 

3050 

A2 

1606 

264 

27 

401 

76 

81 

99 

2554 

a3 

1920 

396 

32 

463 

76 

102 

105 

3094 

a4 

900 

264 

22 

172 

30 

42 

94 

1524 

A 5 

1440 

264 

18 

401 

64 

85 

94 

2366 

K III 

7592 

1452 

126 

1829 

322 

77 6 

491 

5 

(aiTK  Pj)**  at  t = tj  = K III 

1 


12588 


TABLE  3 


TABLE  OF  CONTRIBUTIONS  AT  t_  - t2  + A*2 
Values  at  t ■ t2  + At2 


I-*’ 

i 

2 

3 

4 

5 

6 

7 

% 

Fki 

* A, 

1842 

405 

77 

398 

10 

91 

17 

2840 

2911 

A2 

1716 

405 

77 

408 

10 

16 

17 

2649 

2720 

a3 

2048 

607 

92 

470 

10 

20 

18 

3265 

3349 

a4 

960 

405 

62 

175 

4 

8 

9 

1620 

1672 

a5 

1546 

405 

51 

408 

8 

17 

16 

2441 

2481 

'j* 

8102 

2227 

359 

1859 

42 

152 

84 

Xj<«V 

12815  =£f 

"Kg, 

6j*pj*** 

9641 

2062 

217 

903 

4 

93 

121 

Xl  FkC- 13027.  X6* 

*** 

PI 

PJ*** 

1.1899 

0.926 

0.605 

0.486 

0.0846 

0.613 

1.437 

Individual  contributor 

•*»  WKi 

'*  { aiJK 

)k  Pl***(t2+At 

2) 

to  FKi 

's  |mJK 

Pj  U2 

+ At  2 ) | 

*** 

pT 

\p>  J* 

the  parameter  value  set  result! 

ing  From  optimizing 

^ pKi  « 
i 

X1  / * ***) 

An  pj 
T • i 

k 

TABLE  4 


VARIATION  IN  DISPLAY  PARAMETERS 


Element  is 

5 (aiJK  PI  ) CS  aiJK  (APJ  ) 

for  AH 

\ J 

xi 

X2 

x3 

x4 

x5 

X6 

X7 

<Awk 

A| 

116 

141 

50 

6 

-66 

-375 

-82 

-210 

A2 

no 

141 

50 

7 

-66 

-65 

-82 

+ 95 

a3 

128 

211 

60 

7 

-66 

-82 

-87 

+171 

A4 

60 

141 

40 

3 

-26 

-34 

-78 

+106 

a5 

96 

141 

33 

7 

-56 

-68 

-78 

-75 

(AK)  - (Kj-Kj) 

510 

775 

233 

30 

-280 

-624 

-407 

j 

[-237 

bJ 

3797 

2475 

605 

2075 

767 

915 

940 

j * 1157 
f 

1 ~(PI  ' P!) 

0.128 

0.311 

0. 385 

0.014 

-0.682 

0.428 

-.033  J 

2 (wKi  ' 
i 

' wKi  ) » 

No  change 

(p*-v 

I 

assumed  in  ajj£ 

From  tt  to 

irj)  - 

Data  obtained  as  differences  in 

Tables  3 and  4 

.064 

0. 53 

1.85 

0. 016 

-0.87 

-0.804 

+0.82 

\ pj  • 

6.4% 

53% 

185% 

1.6% 

-87%  • 

-80.4% 

+82% 

TABLE  5 


VARIATIONS  IN  EQUIPMENT  INFLUENCE  COEFFICIENTS.  Element  Is  s(aiJK  Pj 


Change  For  (^2)  from  Tables  2 and  4 * 

* (^2  + A ^ _ f2 


T-> 

X| 

X2 

*3 

x4 

198 

116 

-53 

70 

a2 

185 

116 

-53 

72 

A3 

220 

174 

-63 

83 

a4 

103 

116 

-42 

31 

d A5 

165 

116 

-35 

72 

CD 

Kj-Kj  =AK  (Al2) 

t . S 

871 

638 

-246 

328 

1.905 

0.641 

1.000 

0.738 

Pj  \t%  to 

Assume  that  total  shift  duri 

r>g  (Af2) 

is  due  to  changes 

(aOj 

457 

995 

-246 

430 

PJ 


X5 

X6 

x7 

2 

<1 

AwKi 
PJ  1*2  ) 

-63 

-382 

-81 

-195 

-428 

-63 

-66 

-81 

+110 

-66 

-67 

-83 

-85 

+179 

-61 

-27 

-35 

-77 

+ 69 

+47 

-53 

-69 

-77 

+1 19 

- 31 

-273 

-635 

-401 

^AK  = 281 

-I(AwKi 

0.406 

0.861 

0.515 

1 

°1TK 

-672 

-737 

-778 

Y AwKi 

pr  (*2 1 
Y Akt 

t Pj  " 

- 555 

2 A Sj 

T 

I 


-5tl 


TABLE  6 


TABLE  OF  EQUIPMENT  VARIATIONS 


Fj  fixed;  changes  in  a jjy  Period  * (Ato) 


i 

J 
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condition  of  setting  all  of  the  parameter  measured  values  to  the  same 
constant  — unity.  The  criterion  value  (-540)  is  then  the  z (B?  - Bt)  . 

* J J J 

A priority  ranking  can  be  found  from  the  ratio  of  each  (Bj  - Bj)  to  the 
next  parameter  B j + ± - B j + \ . 

Table  8 gives  the  data  of  Table  7 now  with  the  cell  value  as 
the  percentage  change  which  has  occurred  in  \ ajjK  | for  the  time  change 
At2>  These  are  the  shifts  or  deterioration  in  the  operating  characteristics 
of  the  equipment  defined  for  their  influence  corresponding  to  Xj. 

Table  9 records  the  array  of  ^a^^  coefficients  for  their  values 
after  the  A t2  shift  has  occurred.  These  are  the  total  deteriorated  values 
for  each  interdependence  cell.  Of  some  significance  are  the  % change  in 
the  display  parameters  (for  pj  = 1).  These  cover  a percentage  change  from 
-18%  to  + 3.3%.  The  corresponding  changes  in  the  true  display  parameters 
with  Pj^)  actual  values  show  a range  of  -6.8%  to  +5.5%.  The  change  in 

criterion  value  Ej  bj  for  Pj  = 1 , t = t2  is  from  1 1 , 574  to  10 , 326 . The  data 
indicates  a constant  percentage  change  in  equipment  transfer  characteristics 
with  varying  system  conditions,  except  for  the  one  case  of  (J  = 4) . This 
latter  variation  would  provide  a basis  for  forecasting  a nonlinear  type  of 
deterioration  with  system  operating  conditions.  Such  a variation  does  not 
lend  itself  easily  to  malfunction  isolation,  identification,  and  corrective 
action. 

III.  CONCLUSIONS. 

The  study  is  a first  cut  at  enlarging  our  knowledge  concerning  the 
applicability  and  limitations  of  the  Observation  Matrix  Technique  for  Data 
System  Cataloging.  What  we  have  learned  from  this  study: 

1 . The  matrix  technique  will  permit  the  identification  of  variations 
in  display  integrated  parameters  (system  quantities  w^j)  and  the  individual 
contributions  due  to  each  selected  dominant  equipment  or  software  parameter. 
These  parameters  may  be  functional  in  origin  — hence  an  activity  or  action, 

2.  It  will  permit  the  separation  of  the  variations  into  changes  in 
values  of  the  influence  or  interdependence  (interface)  coefficients  or  po- 
tentials or_  changes  in  the  measured  values  of  the  parameters.  The 
parameters  can  also  be  equipment  signals  inputs  or  stimuli. 
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3.  The  test  in  which  the  parameters  are  replaced  by  a constant 
value  (normalized  to  unity)  does  prove  of  value  in  identifying  the  source 
of  deterioration  (equipment  operating  characteristics  a^K  — or  parameter 
value) . 

4.  It  permits  an  analysis  of  the  variation  as  percentage  change  in 
alJK  or  parameter  values  Pj  for  Xj. 

5 . A variation  in  the  equipment  characteristic  with  system  condition 
such  as  gain,  impedance,  etc.,  is  identified  with  stimulus,  parameter,  and 
condition. 

6.  The  matrix  technique  is  directly  applicable  to  an  optimization 
analysis  which  gives  a set  of  parameter  stimuli  values  for  comparison 
with  actual  measured  values.  This  optimized  set  {Pj}  is  the  desired 
parameter  set  for  the  several  system  conditions  or  configurations.  This 
optimized  set  may  be  the  desired  weighting  for  display  parameter  contribu- 
tions consistent  with  the  integrated  display  values  w^i  and  {gj  Xj]  . 

7.  Item  6 proposes  the  utility  of  the  matrix  and  optimization  tech- 
nique as  an  in-process  design  and  on-line  data  analysis  tool  for  selection 
of  the  distribution  of  display  contributions  for  each  equipment  or  data 
parameter  and,  hence,  selection  of  the  array  of  signal  levels  for  parameter 
central  values . 

IV.  RECOMMENDATIONS . 

1.  The  observation  matrix  technique  for  test  data  display  and 
analysis  be  subjected  to  additional  study  as  a basis  for  on-line  operator 
decision  making. 

2.  The  matrices  be  stored  in  the  same  form  as  used  for  real-time 
data  display  as  a part  of  a historical  data  bank. 

3.  Additional  design  and  utility  studies  be  directed  to  the  suggested 
techniques  as  a basis  for  ADAPT  and  competitive  DMS. 
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LETHALITY  OF  A SPECTRUM  OF  SHAPED  CHARGE  PROJECTILE 
ANTI-TANK  FIREPOWER-KILL  EFFECTS  EVALUATED  BY 
THE  AWAM-1  COMPUTER  MODEL 

Donald  F.  Haskell 

US  Army  Ballistic  Research  Laboratories 
Aberdeen  Proving  Ground,  Maryland  21005 

ABSTRACT.  This  paper  describes  an  anti-tank  lethality  study  per- 
formed by  the  use  of  AVVAM-1,  the  first  version  of  a new  armored  vehicle 
vulnerability  analysis  computer  model  developed  at  the  Ballistic  Research 
Laboratories.  In  this  study  the  vulnerability  of  a Russian  T55  tank  to 
a spectrum  of  anti-tank  shaped  charge  projectile  terminal  ballistic  effects 
is  calculated  and  analyzed.  This  is  done  to  illuminate  the  behind-armor 
effects  characteristics  and  their  combinations  that  are  most  lethal.  The 
measure  of  lethality  employed  in  this  study  is  the  probability  of  achieving 
a firepower  kill  given  a hit  on  a specific  point  on  the  tank. 

Lethality  of  the  various  projectile  effects  is  calculated  for  direct 
frontal  attack  of  the  tank.  It  is  assumed  that  there  is  an  equal  proba- 
bility of  hitting  any  point  on  the  vehicle  attack  aspect.  In  this  manner 
the  individual  projectile  effects  may  be  separated  from  weapons  systems 
hit  probability.  Typical  shaped  charge  projectile  spatial  distributions 
are  employed  for  the  mass,  speed  and  numbers  of  fragments  produced  behind 
the  armor.  The  end  product  of  the  study  is  a ranking  of  the  damage  pro- 
ducing effects  of  the  various  projectile  types  according  to  their  lethality 
in  causing  a firepower  kill.  This  was  obtained  by  a sensitivity  study  of 
the  effects  of  the  various  parameters.  The  conclusions  of  the  study  are: 

1.  Over  the  range  of  the  variables  (normalized  to  the  standard 
BRL  3.3  inch  cone  diameter  precision  shaped  charge)  from  0.6  to  1.2, 
firepower-kill  lethality  of  the  shaped  charge  jet  projectile  in  direct 
frontal  attack  of  the  Russian  T55  tank  is  most  sensitive  to  changes 

in  cone  diameter  and  the  number  and  speed  of  behind- the- armor  frag- 
ments (given  in  decreasing  order  of  influence  on  lethality  of  the 
seven  variables  studied  in  this  investigation) . 

2.  Over  the  range  of  the  standard  charge  normalized  variables 
between  1,2  and  1,8,  lethality  is  most  sensitive  to  changes  in  jet 
velocity,  cone  diameter  and  the  number  of  behind- the- armor  fragments 
(given  in  decreasing  order  of  influence) . Here  again,  lethality  is 
least  sensitive  to  jet  breakup  time. 

3.  On  the  other  hand,  jet  breakup  time  exerts  the  highest  effect 
on  lethality  over  the  standard  charge  normalized  variable  range  from 
1.8  to  2.0,  This  is  followed  by  jet  velocity  and  cone  diameter.  Mass 
of  the  behind- the- armor  fragments  has  the  least  influence  on  shaped 
charge  lethality  over  this  range. 
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4.  A marked  functional  dependence  of  upon  the  behind-the- 

armor  fragment  spray  total  kinetic  energy  was  found.  This  informa- 
tion should  be  very  useful  in  the  design  of  new  shaped-charge  anti- 
tank projectiles.  It  should  also  provide  guidance  in  the  vulnerability 
reduction  of  tanks. 


1.  INTRODUCTION.  The  object  of  this  investigation  was  to  determine 
which  of  seven  selected  shaped  charge  munition  parameters  exert  the  most 
influence  on  the  probability  of  achieving  a firepower  kill  of  the  Russian 
T55  tank  given  a random  hit  on  the  tank  in  a direct  frontal  attack.  The 
seven  parameters  are:  the  liner  cone  base  diameter,  munition  standoff 

distance  from  the  target,  jet  velocity,  and  jet  break-up  time  as  well  as 
the  number,  mass  and  speed  of  the  fragments  generated  behind  the  tank 
armor  by  the  jet.  Each  parameter  was  varied  independently  over  a pre- 
selected range  of  values  to  assess  its  effect  on  the  firepower  capability 
of  the  tank.  In  addition,  the  effects  on  kill  probability  caused  by  varia- 
tion of  combinations  of  these  parameters  was  also  studied. 

AVVAM-11  was  used  to  perform  the  study.  AVVAM-1  (Armored  Vehicle 
Vulnerability  Analysis  Model,  first  version)  is  a conceptual  model  and 
associated  digital  computer  code  developed  at  BRL  to  analytically  assess 
the  vulnerability  of  armored  vehicles.  AVVAM-1  can  be  employed  to  per- 
form both  armored  vehicle  vulnerability  and  anti- armor  weapons  design  and 
analysis  studies.  This  first  version  of  AVVAM  treats  components  and  per- 
sonnel subjected  to  penetration  and/or  perforation  damage  mechanisms. 

The  attacking  munition  may  be  a shaped  charge  or  kinetic  energy  projec- 
tile, or  a shaped  charge  or  Misznay-Schardin  land  mine.  With  additional 
effort  the  present  model  may  be  extended  to  include  other  damage  mechan- 
isms. Although  originally  developed  for  armored  vehicles,  the  code  is 
not  restricted  to  armored  vehicles  - it  may  be  employed  to  assess  the 
vulnerability  of  any  structure. 

AWAM-1  is  an  outgrowth  of  an  existing  digital  computer  code  devel- 
oped by  H.  Ege  of  the  Surface  Targets  Branch,  Vulnerability  Laboratory, 

BRL.  Ege* s code  is  based  on  relations  between  the  characteristics  of 
certain  weapons  and  vehicle  damage  observed  from  the  results  of  antitank 
tests  conducted  under  the  auspices  of  the  UK,  Canada  and  the  US  in  Canada 
during  1959. 2 


D.  F.  Haskell  and  M.  J.  Reisinger,  ’’Armored  Vehicle  Vulnerability  Analysis 
Model  - First  Version, n US  Army  Ballistic  Research  Laboratories  Interim 
Memorandum  Report  No.  85,  February  1973. 


2 

Canadian  Armament  Research  and  Development  Establishment  Report  Q-21, 
"Tripartite  Anti-Tank  Trials  and  Lethality  Evaluation  (U),M  Final  Report 
Part  I,  November  1959,  (SECRET). 
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AWAM-1  is  based  on  analytical  evaluations  of  the  damage  inflicted 
on  individual  critical  components  and  the  aggregate  effect  of  these 
damaged  components  on  compartment  and  overall  vehicle  vulnerability* 

To  do  this*  AVVAM-1  accounts  for  not  only  the  damage  inflicted  on  com- 
ponents in  the  direct  line  of  fire*  or  shotline,  of  the  attacking  muni- 
tion but  also  the  damage  inflicted  by  armor  spall  and/or  munition  fragment 
sprays  on  components  located  away  from  the  munition  shotline*  In  addi- 
tion, AWAM-1  accounts  for  the  degrading  (or  possible  enhancing)  effects 
on  the  spall  and/or  fragment  sprays  caused  by  components  positioned 
between  the  armor  and  the  critical  components.  Thus,  the  potential  pro- 
tection afforded  critical  components  by  intervening  components  is  included 
in  the  AWAM-1  calculational  procedure* 

2,  DESCRIPTION  OF  AWAM-1*  AWAM-l  is  composed  of  two  major  computer 
codes*  One  of  these  characterizes  the  target.  The  other  code  characterizes 
the  munition- target  interaction  and  performs  the  vulnerability  evaluation. 
The  target  characterization  code  describes  the  target  and  identifies, 
locates  and  determines  the  presented  area  of  critical  components.  It  also 
provides  information  concerning  components  that  are  located  between  the 
vehicle  armor  and  the  critical  components. 

To  generate  the  target  description  information,  AWAM-1  employs  the 
GIFT  (Geometric  Information  for  a Target)  code*3  The  GIFT  code  is  an 
improved  version  of  the  existing  MAGIC  code. 4 The  identification,  loca- 
tion and  presented  area  determinations  of  critical  components  and  the 
intervening  component  information  is  generated  by  a subcode  within  the 
GIFT  code  called  RIF  (Rays  Initiated  at  a Point) . 

The  second  major  code  employed  in  AWAM-1  encompasses  the  terminal 
ballistics  of  the  attacking  munition  and  the  post-plate-perforation 
characteristics  of  plate  spall  or  munition  fragment  sprays*  In  addition, 
this  second  code  calculates  the  vulnerability  of  selected  components 
within  the  vehicle  as  well  as  compartment  vulnerability  and  overall 

vehicle  vulnerability.  Because  of  its  functions,  it  is  called  the  P3  and 
3 

C PKH  (Post-Plate-Perforation  and  Component,  Compartment  and  Combat 
Vehicle  Probability  of  a Kill  given  a Hit)  code. 

In  operation,  AWAM-1  selects  critical  components  within  the  target 
and  then  evaluates  the  extent  of  damage  and  kill  probability  for  each 
selected  munition  aim  point  in  a given  view  of  the  target.  It  does  this 


3 

Lawrence  W.  Bain,  Jr.,  and  Mathew  J.  Reisinger,  ,fThe  GIFT  Code  User 
Manual,  Volume  I Introduction  and  Input  Requirements,”  BRL  Report  No, 
1802,  July  1975* 

4 

Armament  Systems,  Incorporated  and  Propulsion  Development  Department, 
"MAGIC  Computer  Simulation,”  Volumes  I and  II,  Naval  Weapons  Center 
Technical  Note  4565-5-71,  Volume  I and  Volume  II,  May  1971* 
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by  determining  the  armor  thickness  in  the  direction  of  the  shotline  of 
the  attacking  munition  and  the  number  of  intervening  components  between 
the  vehicle  armor  and  critical  component.  It  then  utilizes  the  behind- 
the-plate  characterization  of  a specific  munition  to  calculate  kill 
probabilities  given  a hit  for  all  or  selected  critical  components  within 
the  vehicle.  This  whole  process  is  accomplished  by  firing  a large  number 
of  parallel  rays  at  a given  attack  angle  and  azimuth  into  the  target. 

Each  individual  parallel  ray  then  spawns  new  rays  that  are  initiated  at 
the  munition  exit  point  on  the  armor  interior  surface.  These  new  rays 
are  used  to  search  out  the  critical  components,  define  their  position, 
shielding,  and  presented  area.  Then  the  post-plate-perforation  subcode 
converts  terminal  ballistics  input  data  into  an  expected  number  of  hits 

3 

into  each  of  the  critical  components  and  finally  the  CT  PKH  subcode 
determines  the  probability  of  a kill  of  these  components  for  the  expected 
number  of  hits.  The  kill  probabilities  for  all  the  critical  components 
within  a given  compartment  may  be  combined  into  compartment  mobility  (M) , 
firepower  (F),  and  complete  or  catastrophic  (K)  kills.  Values  for  M,  F, 
and  K kills  of  the  whole  vehicle  may  also  be  determined. 

A flow  chart  summarizing  the  operations  of  AVVAM-1  is  presented  by 
Figure  1*  In  this  figure  Box  1 represents  the  target  input.  Box  2 is 

the  RIP- section  of  the  GIFT  code.  Box  3 is  the  P3  section  and  Box  4 is 

3 3 

the  C PKH  section.  The  CT  PKH  section  provides  the  output  in  terms  of 
probability  of  a kill  given  a hit*  Also  indicated  in  the  figure' is  Box 
5 which  indicates  an  iteration  scheme  that  may  be  employed  for  multiple 
views*  Since  the  sections  represented  by  Boxes  1,  2,  3,  and  4 provide  the 
PK/H  output  for  a single  view,  results  for  multiple  views  may  be  obtained 
by  iterating  through  Boxes  2,  3,  and  4 for  each  view  desired* 

The  code  operates  as  follows:  The  particular  target  description  is 

input  through  Box  1 on  cards  and  the  specific  munition  is  input  by  cards 

3 

through  Box  3.  Information  fox  the  P section  is  handled  by  card  input. 
After  the  target  is  described  and  the  critical  components  identified, 
for  a single  vehicle  view,  RTP  selects  a starting  point  on  the  vehicle, 
fires  a main  ray  at  the  starting  point,  and  essentially  determines  the 
position,  shielding,  and  presented  area  of  all  the  critical  components 

3 

in  the  vehicle  in  relation  to  the  shotline  of  the  main  ray.  The  C PKH 
code  calls  on  the  Post-Plate-Perforation  code  to  supply  the  behind  the 
plate  spall  data  and  main  munition  shotline  information  to  include  number 
of  fragments,  size,  and  speed  of  fragments.  Next,  it  calculates  the 
expected  number  of  fragments  to  hit  a given  critical  component,  and  then 
the  probability  of  killing  that  component  given  a hit.  It  does  this  for 
each  critical  component  identified  by  the  RIP  code  for  the  particular 
shotline  selected.  All  the  critical  components  are  evaluated  for  the 
first  shotline.  The  RIP  code  then  moves  to  a new  shotline  (or  shotpoint) 
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and  the  probabilities  of  a kill  given  a hit  are  calculated  for  all  the 
components  in  the  view  of  the  new  shotline.  This  process  is  continued 
until  the  whole  view  of  the  vehicle  is  completed.  At  this  point  the  out- 
put of  the  AWAM  code  is  the  following:  Probability  of  a kill  for  each 

critical  component  in  the  vehicle,  a set  of  compartment  M,  F,  and  K kill 
probabilities  and  overall  vehicle  view  probability  of  M,  F,  and  K kill 

3 

values.  During  these  calculations  the  CTPKH  code  in  conjunction  with  the 
3 

P'  code  account  for  the  mass  and  velocity  attrition  of  the  shotline  and 
spall  fragments  as  they  perforate  intervening  components  between  the  exit 
point  on  the  armor  and  the  specific  critical  component  under  evaluation 
at  that  time. 


3.  STUDY  CONDITIONS,  As  described  previously,  the  object  of  this 
investigation  was  to  determine  which  of  the  following  parameters  has  the 
most  effect  on  the  firepower  kill  lethality  of  a shaped  charge  projectile 
used  in  a direct  frontal  attack  of  the  T55  tank:  liner  cone  base  diameter, 

standoff  distance,  jet  velocity,  jet  breakup  time,  and  the  number,  mass 
and  speed  of  the  behind- the- armor  fragments  produced  by  the  jet.  Each  of 
these  parameters  was  varied  independently  over  a selected  range  of  values 
and  AVVAM-1  was  used  to  calculate  and  follow  the  attendant  variation  in 
firepower  kill  probability  given  a hit  on  the  tank  (Pj^)  - The  effect  on 

PK/H  certa:*'n  combinations  of  parameters  was  also  calculated.  To  do  this, 

equal  variations  in  each  of  the  parameters  in  the  combination  were  employed. 


Those  parameters  used  to  characterise  the  shaped  charge  jet  itself 
were  varied  over  the  following  range  of  values: 

. Liner  cone  base  diameter  = 0. 0S08  m -0. 2032  m 


. Munition-target  standoff  distance  = 0.08382  m -2.09S5  m 

. Jet  velocity  = 2,000  m/sec  -20,000  m/sec 

Jet  break-up  time  - 25  microsecs  -200  microsecs 

The  behind -the -armor  fragment  characteristics  produced  by  the  above  range 
of  conditions  were  automatically  calculated  internally  by  AWAM-1.  In 
addition,  other  calculations  were  made  in  which  the  behind- the- armor 
fragment  characteristics  (number  of  fragments,  fragment  mass  and  fragment 
speed)  were  varied  independently  of  the  normally  expected  values  that 
would  otherwise  be  governed  by  the  jet  parameters.  In  these  cases,  the 
number  of  fragments,  their  mass  and  speed  were  made  proportional  to  those 
produced  by  a standard  shaped  charge  and  were  varied  over  the  range  from 
0.01  to  10  times  the  particular  standard  charge  behind- the- armor  charac- 
teristic value.  That  is,  the  number  of  fragments- to- number  of  fragments 
from  the  standard  charge  ratio,  fragment  mass- to- standard  charge  fragment 
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mass  ratio,  and  the  fragment  speed- to- standard  charge  fragment  speed 
ratio  were  varied  from  0.01  to  10. 

The  standard  shaped  charge5  is  a 42° , apex  angle,  copper  cone  loaded 
with  octol  explosive  in  a light  confining  wall  of  aluminum.  A diagram  of 
this  charge  is  shown  in  Figure  2.  All  linear  dimensions  in  the  diagram 
are  in  inch  units.  It  will  be  referred  to  as  the  "standard  charge."  The 
nominal  liner  wall  thickness  is  .081  inch  (,206  mm)  and  the  outside 
diameter  at  the  base  of  the  cone  is  3,3  inches  (S4  mm).  The  total  liner 
weight  is  .61  lb  (.277  kg),  the  explosive  weight  is  1.93  lbs  (.875  kg), 
and  the  aluminum  body  weight  is  1,14  lbs  (.517  kg).  The  standard  cone 
diameter  and  standoff  as  well  as  the  jet  constants  of  this  standard  charge 
obtained  experimentally  from  flash  radiographs  are  as  follows: 


Cone  diameter 
Standoff 
Jet  velocity 
Break-up  time 


0.0832  m 
0.16764  m 
8,300  m/sec 
103  microseconds 


The  T55  tank  was  assumed  to  be  fully  combat  loaded  and  directly 
attacked  in  the  front  (zero  azimuth  and  elevation)  by  the  shaped  charge 
projectile.  A planar  gridwork  of  60.96  cm  square  cells  was  erected  over 
the  front  of  the  tank  normal  to  the  attack  direction.  A shotline*  to 
simulate  a projectile  flight  path,  was  fired  in  the  attack  direction  at 
a random  point  within  each  cell.  In  this  manner  the  projectile  lethality, 
measured  by  the  firepower  P^^,  was  evaluated  for  strike  points  over  the 

whole  front  of  the  tank.  Then  these  individual  cell  (or  shotline)  P^'s 
were  averaged  together  to  obtain  a single  firepower  kill  P^H#  represen- 
tative of  the  particular  set  of  parameter  values  being  evaluated. 


The  GIFT  description  of  the  T55  tank  consisted  of  approximately  630 
components.  Of  this  total  about  400  were  considered  critical  components. 
About  half  of  these  were  considered  critical  to  the  tank's  firepower. 

In  this  AVVAM-1  analysis,  5 rays  to  simulate  behind- the- armor  fragments 
were  fired  at.  each  of  these  200  firepower  critical  components. 

4,  RESULTS  AND  DISCUSSION.  The  study  results  obtained  by  use  of 
AVVAM-1  are  illustrated  in  Figures  3 through  14  and  Tables  I and  II. 
Figures  3 through  6 show  the  effects  on  average  firepower  kill  V^/\\  of 

independent  variations  in  the  basic  shaped  charge  projectile  jet  charac- 
terization parameters.  In  these  figures  the  ordinate  corresponds  to 

and  the  abscissa  corresponds  to  the  jet  parameter.  The  jet  parameters  are 


"R.  DiPersio,  J.  Simon  and  A.  B.  Merendino,  "Penetration  of  Shaped^Charge 
Jets  Into  Metallic  Targets,"  BRL  Report  No.  1296,  September  1965,  pp.  16-17. 
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displayed  as  a ratio  of  the  actual  parameter  value-to-the  value  of  that 
parameter  exhibited  by  the  "standard  charge."  For  example,  in  Figure  3, 
at  the  abscissa  value  equal  to  2,  the  cone  diameter  under  investigation 
is  twice  the  size  of  the  standard  charge  cone  diameter,  or  ,0168  m 
(6.6  in.).  Each  parameter  was  varied  while  the  remaining  parameters  were 
kept  constant  at  the  value  exhibited  by  the  standard  charge.  In  the  cone 
diameter  case  the  standoff  was  maintained  equal  to  that  of  the  standard 
charge  i.e.,  equal  to  two  cone  diameters. 


As  to  be  expected,  the  figures  show  that  average  increases  with 

increasing  cone  diameter,  jet  velocity  and  breakup  time  and  that 
decreases  as  standoff  is  increased.  The  P^yH  and  its  slope  vary  contin- 
uously with  cone  diameter.  On  the  other  hand,  both  and  its  slope 

vary  irregularly  (almost  discontinuously)  with  jet  velocity  and  break- 
up time.  Figure  5 depicts  a plateau  in  over  the  jet  velocity-to- 

standard  charge  jet  velocity  ratio  from  0.6  to  1.3.  A wider  plateau  exists 
in  the  relationship  between  anc*  breakup  time  as  shown  by  Figure  6. 

In  this  case  the  plateau  extends  from  breakup  time- to- standard  charge 
breakup  time  ratio  equal  to  0,4  to  approximately  1,6. 

The  relationship  between  Pj^  and  standoff  illustrated  in  Figure  4 

is  interesting.  As  to  be  expected,  P^H  decreases  with  standoff  over  the 

range  studied.  However,  this  decrease  is  less  than  would  be  expected- 
the  functional  dependence  of  on  standoff  is  quite  weak.  It  is  much 

less  than  it  is  with  the  other  jet  parameters.  The  jet  lethality  is  not 
significantly  degraded  at  standoffs  much  greater  than  standard. 


Figure  7 illustrates  the  effect 


on  PK//H  °f  simultaneous  and  equal 


variation  of  the  jet  parameters.  As  before,  the  abscissa  represents  the 
value  of  the  parameter  relative  to  its  value  exhibited  by  the  standard 
charge.  Although  here  it  represents  the  value  of  all  four  jet  parameters 
relative  to  the  standard  charge  case.  For  example,  at  the  parameter 
value-to-parameter  value  of  standard  charge  ratio  equal  to  0.5,  the  P^H 

computation  was  performed  with  the  cone  diameter,  standoff,  jet  velocity 
and  jet  breakup  time  all  set  equal  to  one-half  the  parameter  values  cor- 
responding to  the  standard  charge.  As  indicated  by  the  figure,  is  a 


smooth,  increasing  function  of  the  four  jet  parameters.  Its  rate  of 
increase  with  these  four  parameters  is  higher,  as  to  be  expected,  than 
its  variation  with  change  in  the  parameters  individually,  although  the 
combined  effect  is  not  the  sum  of  the  effects  of  the  individual  parameters. 
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Figure  8 shows  the  effect  of  individual  and  combined  change  in  the 


behind^ the- armor  fragment  parameters  on  P 


The  functional  relation- 


ship between  and  the  fragment  parameters  is  similar  for  all  three: 

number  of  fragments,  fragment  mass  and  speed*  P^H  increases  rapidly  as 

the  fragment  parameter- to- standard  charge  parameter  value  ratio  increases 
from  zero  to  1 or  2.  Beyond  this  region,  the  P^H  rise  levels  off.  As 

indicated,  equal  variation  of  all  three  fragment  parameters  at  the  same 
time  yields  higher  than  individual  variation  of  these  parameters. 

However,  the  combined  effects  do  not  show  as  high  an  increase  over  the 
individual  fragment  parameter  variations  as  may  be  expected  beforehand. 

The  slopes  of  the  curves  from  Figures  3 to  6 and  8 in  which  the  jet  and 
fragment  parameters  are  varied  independently  are  plotted  in  Figures  9 and 
10.  Figures  9 and  10  exhibit  the  sensitivity  to  the  various  para- 

meters over  a range  of  these  parameters.  As  indicated,  there  is  no  single 
parameter  that  has  the  most  influence  on  over  the  range  of  independ- 

ent parameters  shown.  Between  normalized  variable  from  0.6  to  about  1.2, 
is  most  sensitive  to  cone  diameter.  Above  1.2  and  below  about  1.8 

jet  velocity  has  the  most  effect  on  P^^.  Between  normalized  variable 

equal  to  1,8  and  2*0,  the  jet  velocity  displays  the  highest  influence  on 
PK/H'  et^ect  standoff,  as  described  earlier,  is  minimal  over  the 

whole  range.  In  regard  to  the  fragment  effects,  the  number  and  speed  of 
the  fragments  show  about  the  same  influence  on  P^H  which,  over  most  of 

parameter  range  shown,  is  considerably  higher  than  the  fragment  mass 
influence. 


The  slopes  of  the  curves  in  Figures  9 and  10  over  the  range  of  para- 
meter -to-  standard  charge  ratio  from  0.6  to  2.0  at  discrete  points  within 
this  range  are  listed  in  Table  I.  The  slopes  are  normalized  to  that  of 
the  cone  diameter.  In  this  manner  those  parameters  that  have  higher,  or 
lower,  influence  on  P^H  than  the  cone  diameter  are  easily  distinguished* 

Table  II  lists  the  parameters  according  to  their  decreasing  order  of 
influence  on  P K over  the  variable  value  normalized  to  standard  charge 


range  from  .6  to  2.  As  indicated,  cone  diameter,  jet  velocity  and  jet 
breakup  time  exhibit  the  highest  influence  on  P^^  over  the  range  of 

variables  shown. 


Figures  11  to  14  are  log- log  plots  of  versus  behind- the- armor 

fragment  total  kinetic  energy.  These  figures  are  included  here  to  illus- 
trate the  relationship  between  behind- the- armor  fragment  kinetic  energy 
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and  average  kill  probability.  Figures  11,  12  and  13  show  the  calculation 
results  for  independent  variations  in  the  fragment  parameters.  In 
Figure  11  the  total  fragment  kinetic  energy  was  increased  by  increasing 
the  number  of  fragments  with  their  mass  and  speed  held  constant  and  equal 
to  the  mass  and  speed  exhibited  by  the  standard  charge.  In  Figure  12 
the  mass  of  the  fragments  was  varied  while  the  number  and  speed  were  main- 
tained equal  to  those  characteristic  of  the  standard  charge.  The  fragment 
speed  was  varied  in  Figure  13,  and  in  Figure  14  all  three  parameters  were 
varied  simultaneously  and  equally.  The  lines  in  these  figures  were  drawn 
in  by  "eye."  These  figures  indicate  that  there  appears  to  be  a definite 
relationship  between  behind- the- armor  fragment  total  kinetic  energy  and 
resultant  P^^.  In  each  figure,  P^^  increases  with  total  fragment 

kinetic  energy.  Furthermore,  there  are  two  distinct  regions  of  fragment 

4 5 

influence,  with  kinetic  energy  of  the  order  of  10  to  10  joules  as  the 
demarcation  zone  between  these  two  regions.  Fragment  kinetic  energy  has 
a much  larger  effect  on  in  the  lower  kinetic  energy  region  than  in 

the  higher  region. 


CONCLUSIONS. 


1.  Over  the  range  of  the  variables  (normalized  to  the  standard 
BRL  3.3  inch  cone  diameter  precision  shaped  charge)  from  0.6  to  1.2, 
f irepower-kil 1 lethality  of  the  shaped  charge  jet  projectile  in  direct 
frontal  attack  of  the  Russian  T55  tank  is  most  sensitive  to  changes  in 
cone  diameter  and  the  number  and  speed  of  behind- the- armor  fragments 
(given  in  decreasing  order  of  influence).  Over  this  same  range,  jet 
breakup  time  exerts  the  least  influence  on  lethality  of  the  seven  vari- 
ables studied  in  this  investigation. 


2.  Over  the  range  of  the  standard  charge  normalized  variables 
between  1.2  and  1.8,  lethality  is  most  sensitive  to  changes  in  jet  veloc- 
ity, cone  diameter  and  the  number  of  behind- the- armor  fragments  (given 

in  decreasing  order  of  influence) . Here  again,  lethality  is  least  sen- 
sitive to  jet  breakup  time. 

3.  On  the  other  hand,  jet  breakup  time  exerts  the  highest  effect 
on  lethality  over  the  standard  charge  normalized  variable  range  from  1.8 
to  2.0.  This  is  followed  by  jet  velocity  and  cone  diameter.  Mass  of 
the  behind-the-armor  fragments  has  the  least  influence  on  shaped  charge 
lethality  over  this  range. 


4. 


A marked  functional  dependence  of  P 


K/H 


upon  the  behind-the- 


armor  fragment  spray  total  kinetic  energy  was  found. 
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Figure  1.  AVVAM-l  Code  Summary  Flow  Chart 


CONE  HEIGHT  * 3.84 


Figure  2.  Diagram  of  Standard  Shaped  Charge 
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Figure  7*  Effect  of  Simultaneous  and  Equal  Variation  in  Cone 
Diameter,  Standoff,  Jet  Velocity  and  Jet  Breakup 
Time  on 
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VARIABLE  NORMALIZED  TO  STANDARD  CHARGE 


Figure  9.  Sensitivity  to  Shaped  Charge  Variables 


205 


Figure  10.  Sensitivity  to  Fragment  Variables 
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I 


207 


in  Number  of  Fragments 


TOTAL  FRAGMENT  KINETIC  ENERGY,  joules 


Figure  14,  ^K/H  Variation  with  Total  Fragment  Kinetic  Energy  for 

Simultaneous  and  Equal  Variations  in  Fragment  Number, 
Mass  and  Speed 


Table  I.  P 


Sensitivities  Relative  to  Cone  Diameter 


K/Il 


VARIABLE 

.6 

.8 

1 

■ 

1.4 

1.6 

1.8 

2.0 

CONE  DIA. 

1 

1 

1 

1 

1 

1 

1 

1 

STANDOFF 

.005 

-.008 

-.024 

-.057 

-.079 

-.174 

-.203 

-.217 

JET  VELOCITY 

.172 

0 

0 

0 

6,263 

2.333 

1,855 

1.681 

BREAKUP  TIME 

0 

0 

0 

0 

0 

0 

2.203 

14.493 

NUMBER  OF  FRAGS. 

.211 

*236 

.246 

. 3S9 

.351 

.435 

,319 

.304 

MASS  OF  FRAGS. 

.072 

,098 

,114 

,156 

.316 

.261 

.145 

*058 

FRAGMENT  SPEED 

.187 

* 226 

*237 

.286 

,202 

.319 

.261 

.116 
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Table  II.  P 


Sensitivities  in  Descending  Order  of  Influence 


K/H 


INFLUENCE 

VARIABLE  VALUE 

NORMALIZED  TO  STANDARD 

CHARGE 

* 6 

.8 

1 

1.2 

EDI 

1*8 

2 

HIGHEST 

CD 

CD 

CD 

CD 

VJET 

VJET 

Tl 

Tl 

NF 

NF 

NF 

NF 

CD 

CD 

VJET 

VJET 

SF 

SF 

SF 

SF 

NF 

NF 

CD 

CD 

VJET 

MF 

MF 

MF 

MF 

SF 

NF 

NF 

MF 

S 

S 

S 

SF 

MF 

SF 

S 

S 

VJET 

VJET 

VJET 

S 

S 

S 

SF 

LOWEST 

Tl 

Tl 

Tl 

Tl 

Tl 

Tl 

MF 

MF 

NOTE: 

CD:  Cone  diameter 

VJET:  Jet  velocity 

Tl:  Jet  breakup  time 

S:  Shaped  charge  standoff  distance 

NF:  Number  of  behind- the- armor  fragments 

SF:  Speed  of  behind- the- armor  fragments 

MF:  Mass  of  behind- the- armor  fragments 
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DEVELOPMENT  AND  OPTIMIZATION  OF  SIGNAL  PROCESSING 
UTILIZED  IN  A MINE  DETECTION  SYSTEM 

Abram  Leff 

U.S.  Army  Mobility  Equipment  R&D  Command 
Fort  Bel voir,  Virginia  22060 


ABSTRACT.  Based  upon  an  extensive  R&D  program  a broadband  microwave 
technique  has  evolved  which  can  be  utilized  to  rapidly  scan  roads  and  reliably 
detect  both  metal lic/non-metallic  cased  AT  mines.  In  order  to  optimize 
mine  detection  capability  and  minimize  false  alarms,  processing  algorithms 
had  to  be  developed  and  evaluated.  Field  data  was  initially  recorded  on 
analog  tape  for  off-site  processing.  This  data  was  utilized  to  develop 
potential  processing  algorithms. 

To  permit  the  evaluation  of  algorithms  in  real-world  conditions,  a 
mobile  real-time  feasibility  system  was  designed  and  fabricated.  This 
system  incorporated  a CDC  469  minicomputer  which  allowed  storage  in 
memory  of  up  to  six  algorithms,  as  well  as  the  control  programs  for  the 
system.  Flexibility  incorporated  in  the  design  and  the  addition  of  a 
keyboard  unit  provided  the  capability  to  utilize  any  one  of  the  six 
algorithms  or  vary  most  test  parameters  in  the  field,  by  simply  changing 
an  address  in  the  System  Control  Unit  or  CPU. 

This  paper  will  discuss  utilization  of  a minicomputer  in  the  development 
of  an  optimized  processing  algorithm  for  mine  detection. 

1.  INTRODUCTION.  Considerable  data  processing  and  analysis  effort  has 
been  expended  with  the  objective  of  developing  target  discrimination  techniques 
for  detecting  road  mine  responses  in  the  presence  of  normal  background 
responses.  The  goal  of  this  effort  was  the  development  of  field-operable 
software  that  optimizes  the  probability  of  detecting  AT/AV  mines  under 
field  conditions  while  concurrently  minimizing  the  false  alarm  rate. 

The  inherent  nature  of  the  data  analysis  problem  dictates  a tradeoff 
between  the  probability  of  target  detection  vs.  the  false  alarm  rate. 

This  relationship  in  a specific  situation  depends  upon  the  particular 
field  conditions  encountered;  therefore,  large  amounts  of  test  data  had  to  be 
gathered  from  a considerable  variety  of  test  site  conditions  in  order  to 
optimize  detection  vs.  false  alarms. 

Initially,  field  data  was  recorded  on  a four  channel  analog  tape  recorder 
and  was  off-line  converted  to  digital  data  format  and  recorded  on  IBM-compatible 
magnetic  tape  which  then  could  be  played  back  in  various  ways  to  produce  plots 
of  the  desired  data  parameters.  This  digitized  data  also  formed  the  basis 
for  the  development  of  the  processing  algorithms  by  means  of  computer 
analysis. 
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2.  STATEMENT  OF  PROBLEM.  To  attack  the  basic  data  analysis  problem, 
which  was  to  develop  a field-operable  software  system  maximizing  target 
detection  while  minimizing  false  alarms,  a methodical  investigation  was 
initiated  to  (a)  define  the  general  target  responses,  (b)  select  the  most 
favorable  processing  methods,  (c)  find  the  best  way  to  represent  the  data, 

(d)  identify  the  decision  rules  for  discrimination  between  mine  signature 
and  background  data,  and  (e)  minimize  the  computational  complexity  needed  to 
perform  the  task. 

Based  upon  the  results  of  the  various  analysis  and  processing  schemes 
which  were  applied  to  test  data,  it  was  concluded  that  two  initial  processes 
were  essential:  (a)  calculation  of  a background  estimate  based  on  preceding 
data  inputs,  and  (b)  normalization  of  the  input  data  with  respect  to 
the  background  estimate.  The  latter  is  required  because  of  the  large 
differences  in  absolute  amplitude  levels  in  the  various  frequency  channels 
caused  by  frequency  roll-off  of  the  antennas,  and  because  of  different 
attenuation  levels  in  the  soil. 

3.  ALGORITHM  DEVELOPMENT.  The  first  useful  algorithm  developed  was 
named  Target  Amplitude  Descriptor,  or  TAD,  where  for  each  channel  and  at 
each  consecutive  position  the  ratio  between  the  input  data  and  the  background 
estimate  was  established,  after  which  these  ratios  were  averaged  for  all 
channels  of  a particular  antenna  pair. 

Logarithmic  TAD  plots  were  made  from  the  recorded  test  data  using 
the  amplitude  information  R-jj  (ratio  between  receive  and  transmit  signal 
levels  in  dB  as  measured  at  position  i and  frequency  j)  in  the  following 
manner: 

j=n 

Log  TAD,  = 20  log  { 1 l A^/B..  } (in  dB) 

n . , 'J  >J 
J=1 

where:  A-  = linear  value  of  R-  (by  using:  A^  = 10  Rij/20) 

n = number  of  frequency  channels 

B-j,  = linear  value  of  background  estimate  at  position  i and 
frequency  j,  determined  as  follows: 

Bij  = 0 “ a ) xBi.lfj  + a x 

where  a is  a constant  chosen  such  that  the  background  estimate  relates  to  a 
selected  distance: 
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-2AXj 

a - 1 - e"0®^ 

where  AX^  = position  increment  in  meters 

"Decay"  can  be  defined  also  as  the  trailing  distance  where  the  relative 
weight  of  the  background  contribution  is  equal  to  13.5%.  This  process 
is  known  as  exponential  smoothing  (i.e.  gives  greater  weight  to  most 
recent  data),  with  the  decay  analogous  to  two  time  constants  in  an  R-C 
filter  network.  Expressed  in  terms  of  a: 

2AXj 

Decay  = " In  (1-a) 

Initially  all  mines  responses  were  differentiated  on  TAD  plots  because 
the  target  data  was  not  excluded  from  the  background  estimate,  in  essence 
reducing  the  target  responses.  To  correct  this,  a threshold  condition 
was  initially  inserted  in  the  determination  of  the  weighting  factor  a. 

In  test  areas  with  large  background  fluctuations,  however,  the  background 
peaks  would  also  be  excluded  from  the  background  estimate  when  the 
threshold  was  exceeded. 

Average  (ASUM)  although  originally  developed  as  a modifier  for  the 
weighting  factor  to  dynamically  exclude  target  responses  from  the  background 
estimate,  the  process  designated  ASUM  proved  to  be  a most  successful 
algorithm.  ASUM  is  defined  as  follows: 

n 

ASUM,  = 1 E An,j 
j=l 

where : 

n = number  of  frequency  channels 

An^j  = the  normalized  data  at  position  i and  frequency  j 


aij 


linear  value  of  amplitude  data  for  position  i and  frequency  j 
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B-jj  = exponentially  smoothed  background  estimate  for  position  i and 
frequency  j . 

a-.  = standard  deviation,  obtained  from  the  exponentially  smoothed 
J estimate  of  the  variance. 

and  a—  are  determined  by: 

Bij  * Bi-l,j  (1-a)  + ^i-l.j 

°ij2  = °i-l,j  (1"a)  + a(Ai-U  ' Bi-1,/ 

where  the  weighting  factor  a was  initially: 

. -2AXj/decay 
ct^  = 1-e  ' 

where 

AXi  = position  increment  between  samples  at  i and  i-1 

Decay  = the  distance  prior  to  position  i where  the  relative  weight  of 
the  contribution  to  the  background  estimate  is  down  to  13%. 

The  weighting  factor  was  later  modified  to: 

ay,  if  ASUMi  < 0.9 
a2  ^ 0;  otherwise 


and  finally  to: 

4 

ot3  = ot1/(  1 + ASUMi  ) 

The  weighting  factor  was  modified  in  order  to  compensate  for  the 
effects  of  varying  background  levels  as  a function  of  soil  conditions  and 
moisture  conditions. 

Many  more  attempts  were  made  to  further  improve  target  discrimination 
by  evaluating  statistical  methods  and  methods  using  magnitude  and/or 
duration  of  target  responses.  As  a result,  six  algorithms  were  developed 
for  evaluation  in  real-time,  real-world  field  tests  with  the  feasibility 
model . 
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4.  FEASIBILITY  MODEL.  To  provide  real-time  test  results  and 
flexibility  in  the  selection  of  test  parameters  (position  and  frequency 
sample  intervals,  frequency  range,  etc.),  a mine  detection  feasibility 
model  incorporating  advanced  discrimination  techniques  was  designed 
and  fabricated.  (See  block  diagram,  Figure  4-1).  A programmable 
minicomputer  (CDC  Model  649)  was  incorporated  into  the  system  to 
increase  the  data  processing  capabilities,  an  X-Y  recorder  was  added 
to  provide  real-time  plots  of  test  results,  and  a four-channel  analog 
instrumentation  recorder  was  employed  to  record  the  test  data  for  off-site 
processing  and  analysis. 

This  feasibility  system  was  installed  on  an  electrically  powered 
tractor  (G.E.  Model  E-15)  as  shown  in  Figure  4-2.  Field  tests  were 
conducted  (and  continue  to  be  conducted)  at  various  field  sites,  with 
the  purpose  of  evaluating  the  real-time  discrimination  techniques 
that  were  previously  developed  utilizing  an  off-line  system. 

The  electronic  components  of  the  feasibility  model  consist 
primarily  of  a transmitter,  a receiver,  a System  Control  Unit  (SCU)  and 
the  data  processing  subsystem  (CDC  469).  As  described  in  the 
subparagraphs  that  follow,  the  receiver  compares  the  phase  and  amplitude 
of  received  signals  to  the  transmitted  signal.  The  resulting  phase 
and  amplitude  data  are  examined  by  the  data  processing  subsystem  for 
indications  of  the  presence  of  a mine  under  the  detector  heads. 

4.1  FEASIBILITY  SYSTEM  OPERATION.  The  major  system 
components  — transmitter,  receiver,  system  control  unit,  processor 
and  power  supplies  — are  housed  in  an  equipment  enclosure  which  mounts 
to  the  front  of  the  tractor.  The  antenna  switches  mount  near  the 
search  head  at  the  end  of  the  frame,  and  the  shaft  encoder  that 
correlates  mine  detector  data  to  search  head  position  and  operating 
speed  is  driven  by  a rear  wheel  on  the  tractor. 

The  System  Control  Unit  (SCU)  responds  to  synchronizing  pulses 
from  the  position  encoder  and  control  commands  generated  via  the 
keyboard  unit,  producing  sweep  control  signals  for  use  by  the 
transmitter.  The  transmitter  signal  originates  in  a voltage-controlled 
oscillator  (VCO)  whose  output  frequency  is  proportional  to  input 
sweep  voltage.  The  VCO  output  is  power-amplified  to  approximately 
1 watt  and  supplied  via  a directional  coupler  and  solid-state 
switch  to  the  transmit  antennas,  with  a -20  dB  output  of  the  coupler 
supplied  as  a reference  to  the  receiver. 

The  receive  antennas  connect  via  another  solid-state  switch  to 
the  receiver.  The  SCU  synchronously  controls  the  two  antennas  switches, 
providing  the  scanning  sequence  that  selects  each  transmit-receive 
antenna  pair  in  turn.  The  receiver  contains  a synchronous  detector 
that  yields  an  amplitude  output  and  a phase  detector  that  yields  the 
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phase  data  signal.  Receiver  outputs  are  digitized  and  processed  by  the 
SCU  and  CPU. 


4.1.1  SYSTEM  CONTROL  UNIT.  In  addition  to  generating  scan 
sequence  commands  in  response  to  position  encoder  pulses,  and  generating 
sweep  control  voltages  and  antenna  switching  commands,  the  SCU  processes 
amplitude  data  supplied  by  the  receiver,  detects  target  responses  in  the 
processed  data,  and  activates  the  target  alarm  and  identifies  the  position 
of  a detected  target.  In  particular,  the  SCU  produces  control  signals 
for  system  operation. 

Various  computer  operational  commands  and  any  one  of  the  processing 
algorithms  stored  in  the  computer  memory  can  be  addressed  via  the  SCU 
by  means  of  appropriate  keyboard  entries.  The  keyboard  is  also  employed 
to  set  up  the  SCU  control  circuits  which  establish  the  test  parameters 
for  a particular  operation,  as  follows: 

(1)  Frequency  Increment  - Amplitude  data  supplied  by  the  receiver  is 
sampled  at  intervals  as  the  frequency  range  is  swept,  and  the  frequency 
change  during  a sample  is  defined  as  the  frequency  increment.  The 
frequency  increment  is  selectable  over  a range  from  10  MHz  to  150  MHz  in 
multiples  of  10  MHz. 

(2)  Start  Frequency  - The  start  frequency  is  defined  as  the  frequency 
chosen  for  the  low  end  of  the  swept  frequency  range,  and  is  selectable  in 
multiples  of  10  MHz  from  300  to  990  MHz. 

(3)  Sample  Interval  - The  sample  interval  is  a measure  of  the 
duration  between  each"  frequency  sample,  and  is  selectable  over  a range 
from  0.1  ms  to  1.5  ms  in  multiples  of  0.1  ms. 

(4)  Number  of  Sample  Intervals  - The  number  of  sample  intervals  per 
sweep  (up  or down)  is  selectable  over  a range  from  1 to  15  intervals  per 
sweep. 

(5)  Transition  Time  - When  a sweep  reaches  the  end  frequency  (either 
high  or  low) , it  dwel Is  at  this  frequency  for  a selected  time  period  to 
allow  the  antenna  switches  to  select  the  next  antenna  pair.  This  transition 
time  is  selectable  from  0.1  to  1.5  ms  in  multiples  of  0.1  ms. 

(6)  Number  of  Sweeps  - The  number  of  sweeps  per  scan  normally 
corresponds  to  the  number  of  antenna  pairs  in  the  search  head  array,  thus 
allowing  one  sweep  per  pair.  The  number  of  sweeps  per  scan  is  selectable 
over  a range  from  1 to  16. 

(7)  Scan  Rate  - The  scan  rate  is  the  number  of  complete  scans 
(sweeping  all  antenna  pairs)  per  unit  of  travel.  The  selectable  rates  are 
25,  50,  100  and  200  scans/meter. 
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4.1.2  CENTRAL  PROCESSOR  UNIT  (CPU).  The  processor  employed  in  the 
feasibility  model  is  a CDC  Model  469  minicomputer  with  8K  words  of  plated 
wire  memory.  Word  length  is  16  bits,  and  the  cycle  time  is  1 ys.  There 
are  42  instructions,  and  the  CPU  responds  to  direct,  indexed  direct  and 
indirect  addressing  modes. 

The  function  of  the  processor  is  to  analyze  the  digitized  data 
supplied  by  the  SCU  and  determine,  with  a high  degree  of  certainty,  whether 
a received  response  is  caused  by  a mine  or  some  other  anomaly  in  the  ground. 

If  the  presence  of  a mine  is  indicated,  an  output  signal  is  generated 
for  use  in  sounding  an  alarm. 

The  8K  memory  in  the  processor  stores  and  protects  all  the  processing 
algorithms,  and  also  contains  the  operational  programs  for  the  SCU. 

Thus,  by  means  of  the  control  keyboard,  any  of  the  various  algorithms  can 
be  selected,  and  parameter  changes  to  both  processing  algorithms  and 
SCU  programs  can  be  entered  at  any  time,  as  desired. 

Through  use  of  the  computer  peripherals,  the  contents  of  the  memory 
can  be  changed  as  desired  by  connecting  the  system  to  the  programmer's 
console.  Changes  can  then  be  entered  using  magnetic  tape,  paper  type,  or 
via  the  teleprinter.  A monitor  oscilloscope  is  available  for  displaying 
the  contents  of  the  memory. 

5.  CONCLUSIONS.  Utilizing  the  feasibility  model,  a test  program 
was  performed  in  order  to  evaluate  the  developed  algorithms  as  well  as 
several  detector  head  configurations.  Six  algorithms  were  stored  in 
memory  in  the  CDC  469  minicomputer  and  could  be  selected  with  the  keyboard, 
for  operation  in  the  system.  Nominal  default  values  were  included  for  the 
variable  test  parameters  such  as  decay  and  position  increment  as  well  as 
for  threshold  values  or  conditions  applicable  to  certain  algorithms.  All 
these  could  be  changed  by  addressing  a defined  location  in  the  CPU  by 
means  of  the  keyboard.  In  addition,  two  versions  of  ASUM  were  included 
to  establish  accuracy  requirements:  one  in  floating  point  and  one  in 
6-place  fixed  point.  The  latter  provided  insufficient  accuracy  and  was 
later  dropped. 

From  the  start  it  was  apparent  that  performance  of  any  algorithm  using 
one  or  more  thresholds  hinged  on  the  margin  over  which  these  threshold 
values  could  be  used  at  any  location  and  with  any  soil  composition  or 
moisture  content.  Obviously,  under  operational  conditions,  it  would  not 
be  feasible  to  obtain  a specific  threshold  value  for  the  encountered  soil 
conditions.  One  could  hardly  be  expected  to  operate  over  a road  to 
establish  the  required  threshold  value,  then  go  back  over  it  to  detect 
any  possible  mines. 

After  a lengthy,  time  consuming  series  of  tests  at  several  test  sites, 
it  was  found  that  threshold  values  varied  widely  with  different  soil  conditions. 
With  the  need  for  a priori  knowledge  of  soil  conditions  established,  the 
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further  use  of  threshold  dependent  algorithms  were  abandoned  for  the 
practical  reasons  stated  above. 

Real  Time  field  evaluation  of  the  TAD  and  ASUM  algorithms  indicated 
that  best  performance  could  be  achieved  with  ASUM  (floating-point  version). 
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FEASIBILITY  SYSTEM  BLOCK  DIAGRAM 
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Figure  4-2:  Feasibility  Model.  Mine  Detection  System 


AUTOMATED  CONTROL,  DATA  ACQUISITION,  AND  ANALYSES 
FOR  HYDRAULIC  MODELS  OF  TIDAL  INLETS 

D.  L.  Durham;  H.  C.  Greer,  III;  and  R.  W.  Whalin 
U.  S.  Army  Engineer  Waterways  Experiment  Station 
P.  0 . Box  631,  Vicksburg,  Miss.  39180 

ABSTRACT.  An  Automated  Data  Acquisition  and  Control  System  (ADACS) , 
which  was  developed  (Durham  and  Greer,  1975)  at  the  Waterways  Experiment 
Station  (WES)  during  the  past  two  years,  has  been  expanded  to  provide 
automated  control,  data  acquisition  and  analyses  for  hydraulic  models 
of  tidal  inlets.  ADACS  configuration  consists  of  a minicomputer  with 
32K  16-bit  words  of  memory,  an  interval  timer  (1  ysec),  an  analog  to 
digital  12-bit  converter  with  64  analog  inputs  (+_  10  volts)  and  45  kHz 
multiplexer,  96  sense/control  lines,  a magnetic  tape  controller  with 
two  9- track  tape  drives,  a moving  head  disk  controller  with  one  dual 
disc  drive  (removable  and  non-removable  platters),  one  matrix  electro- 
static printer/plotter,  and  an  ASR  33  teletype  unit. 

ADACS  controls  the  hydraulic  generation  of  the  tide  in  the  tidal 
inlet  model  by  providing  a programmable  analog  voltage  to  the  hydraulic 
tide  generator.  The  programmable  tide  controller  can  simulate  a tide 
composed  of  one  to  N tidal  constituents  for  as  many  tidal  cycles  as 
required.  Model  tidal  elevations  are  recorded  by  ADACS  using  a bubbler 
system  which  measures  small  hydrostatic  pressure  changes  associated  with 
changes  in  tidal  elevations  in  the  model.  The  bubbler  system  consists 
of  a high  precision  pressure  transducer,  a scanivalve  device  for  sequencing 
input  ports,  and  48  pressure  inputs.  The  pressure  transducer  can  be 
calibrated  prior  to  and  at  selected  time  intervals  during  each  tidal 
test  to  provide  accurate,  updated  calibration  data  for  scaling  voltage 
(pressure)  data  to  tidal  elevations.  In  addition  to  collecting  tidal 
elevation  data,  tide  velocities  at  specific  model  locations  are  monitored 
by  using  miniature,  electromagnetic  current  meters.  Besides  controlling 
the  tide  generator  during  a tidal  test,  ADACS  acquires  the  above  tidal 
data  (elevations  and  velocities)  including  calibration  information  and 
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test  parameters  and  records  these  data  on  magnetic  tape  for  future 
analyses  and  permanent  storage* 

Data  analyses  include  scaling  and  editing  of  original  data,  least 
squares  harmonic  analyses  of  tidal  data  (elevation  and  velocity)  for 
amplitude  and  phase  of  various  tidal  constituents,  plots  of  original 
data  and  superimposed  harmonic  constituents,  and  analyses  of  residual 
variances.  Verification  of  hydraulic  tidal  inlet  models,  which  are 
geometrically  distorted  models  scaled  on  the  Froude  model  law,  requires 
artificially  simulating  in  the  model  the  frictional  effects  associated 
with  prototype  roughness  of  bottom  and  sides  of  channels  and  overbank 
and  marsh  land  roughness.  The  relative  phase  lags  of  major  tidal  constituents 
from  one  specific  location  to  another  in  the  prototype  and  the  measurement 
of  these  phase  lags  in  the  model  provide  a means  for  estimating  the 
amount  of  roughness  and  specific  model  areas  requiring  artificial  roughness 
in  order  to  achieve  model  verification. 

These  procedures  for  spatial  definition  of  roughness,  full  model 
verification,  and  model  testing  using  ADACS  and  associated  automation 
procedures  have  been  successfully  applied  in  hydraulically  modeling 
Murrells  Inlet,  South  Carolina.  The  required  time  for  model  verification 
and  testing,  as  well  as  data  analyses,  has  bben  significantly  reduced. 

In  addition,  the  quality  and  quantity  of  model  data  has  increased  with 
minimal  cost  increases.  Thus,  better  information  can  be  made  available 
on  which  to  make  sound  engineering  decisions  regarding  the  planning  and 
design  of  proposed  engineering  changes  in  tidal  inlets. 

1.  Introduction.  Over  the  past  decade,  automated  processing  techno- 
logy has  evolved  from  large,  expensive  computers  to  minicomputers  and 
microprocessors.  With  this  evolution,  the  physical  size  and  cost  of 
automated  processing  systems  have  greatly  decreased  with  a minimal 
decrease  in  system  capabilities.  Cost  reductions  for  such  systems  have 
resulted  in  economic  justification  of  the  use  of  minicomputer  and  micro- 
processors to  a specific  task  or  group  of  specific  operations;  whereas, 
large  computers  can  be  justified  economically  only  for  multiple  operations 
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and  tasks.  The  automation  of  physical,  hydraulic  modeling  techniques 
has  lagged  automation  efforts  in  many  other  fields  mainly  because  of 
cost  justification  and  the  requirements  of  highly  specialized  instrumentation 
(e.g.  sensors).  However,  needs  for  such  automation  have  existed  for  many 
years.  These  needs  are  the  result  of  requirements  for  (1)  real-time  model 
control  decisions,  (2)  quasi  real-time  data  analyses,  and  (3)  more 
accurate  and  reliable  model  data  for  engineering  and  environmental 
interest  studies. 

One  mission  of  the  Hydraulics  Laboratory  of  the  U.  S.  Army  Engineer 
Waterways  Experiment  Station  is  the  physical  modeling  of  hydraulic  prob- 
lems associated  with  the  activities  of  the  Corps  of  Engineers,  as  well  as 
other  government  and  private  agencies.  Hydraulic  problems  associated 
with  wave  phenomena  and  the  effects  of  these  phenomena  in  harbors,  tidal 
inlets,  and  along  the  open  coast  are  the  primary  modeling  interests  of 

the  Wave  Dynamics  Division  (WDD)  of  Hydraulics  Laboratory.  Over  the 

1 2 

last  three  years,  WDD  has  been  very  successful  in  automating  9 the  major 
aspects  of  its  physical  models  for  wave  and  tidal  inlet  studies.  Major 
automation  efforts  were  devoted  to  (1)  model  control,  (2)  model  data 
acquisition,  and  (3)  model  data  reduction  and  analyses.  The  subject  of 
this  paper  is  a description  of  the  automated  system,  which  has  been  given 
the  name  "Automated  Data  Acquisition  and  Control  System"  (whose  acronym  is 
ADACS),  for  wave  and  tidal  models  and  its  general  application  to  tidal 
inlet  model  studies, 

2,  SYSTEM  CONFIGURATION,  The  automated  system  for  wave  and  tidal 
inlet  models  has  two  primary  functions:  (1)  automated  acquisition  of 

wave  and  tide  data  in  a format  (magnetic  tape  or  disc)  compatible  for 
digital  reduction  and  analyses  and  (2)  automated  control  of  model  sensor 
calibration  and  of  the  wave  and  tide  generators.  The  design,  development, 
and  configuration  of  automated  systems  to  perform  these  functions  with 
particular  applications  to  wave  models  were  presented  in  Reference  1. 

System  configuration  (Fig,  1)  of  ADACS  consists  basically  of  the  following 
four  subsystems: 
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a.  Digital  data  recording  and  controls. 

b.  Analog  recorders  and  channel  selection  circuits, 

c.  Wave/ tide  sensors  and  interfacing  equipment. 

d.  Wave/ tide  generators  and  control  equipment. 

The  first  subsystem  is  basically  a 32  K,  16-bit  word  minicomputer 
with  I/O  and  storage  devices,  analog/digital  packages,  and  a timing 
package.  Details  of  this  subsystem  were  presented  by  Durham  and  Greer 
(1975).  The  analog  recording  subsystem  is  (1)  a backup  for  the  digital 
data  recording  subsystem  and  (2)  a visual  display  for  operator  in- 
spection of  analog  signals  from  model  sensors.  This  subsystem  has  manual/ 
automated  selection  and  control  of  five,  12-channel  oscillographs  and  a 
test  point  center  for  manually  monitoring  a selected  channel  as  to  system 
setup,  calibration,  and  signal  condition. 

The  model  sensor  subsystem  includes  instrumentation  for  both  wave 
and  tide  sensors*  Details  of  the  wave  sensor  subsystem  and  calibration 
procedures  were  presented  in  Reference  1,  The  wave  sensor  subsystem 
consists  basically  of  the  following  four  components: 

a.  Wave  height  sensors  and  stands. 

b.  Power  supplies  and  signal  conditioning  equipment, 

c.  Manual  and  automatic  calibration  equipment. 

The  sensor  subsystem  for  tidal  heights  includes  the  following  major 
components: 

a.  Bubble  tubes,  stands,  and  high  pressure  supply. 

b.  Scanivalve  with  manual  and  automated  controls. 

c.  Precision  pressure  transducer. 

d.  Power  supplies  and  signal  conditioning  equipment. 

The  last  subsystem  includes  controls  for  both  wave  and  tide  generators. 
Controls  for  both  mechanical  and  electrohydraulic  wave  generators  can 
be  provided  by  ADACS.  Start/stop  commands  are  available  for  mechanically 
gear-driven  wave  generators;  however,  controls  for  wave  period  and 
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amplitude  must  be  supplied  manually.  For  the  electrohydraulic  wave 
generator,  ABACS  provides  a programmable  analog  voltage  as  a command 
signal  to  the  servocontroller  of  the  electrohydraulic  actuators.  The 
wave  period  and  amplitude  for  these  wave  generators  are  controlled  by 
ADACS. 

The  command  signal  to  the  tide  generator  is  a programmable  analog 
voltage  which  is  supplied  by  ADACS  through  one  channel  of  the  digital  to 
analog  converter.  The  tide  generator  has  the  option  of  receiving  this 
command  signal  from  ADACS  or  accepting  an  analog  voltage  from  a pro- 
grammable cam  and  reference  potentiometer  arrangement.  This  latter 
control  scheme  is  used  as  a back-up  or  alternate  control  to  the  ADACS 
control  and  until  recently  has  been  the  primary  control  of  the  tide 
generator  prior  to  installation  of  ADACS.  In  addition  to  the  command 
signal,  the  tide  generator  has  four  other  major  components  which  are 
(a)  differential  amplifier  and  power  supply,  (b)  bubble  tube  positioner, 

(c)  hydraulic-pneumatic  amplifier,  and  (d)  hydraulic  cyclinder  and  flow- 
control  gate  assembly. 

Basically,  the  tide  in  a physical  model  is  generated  from  cyclic 
exchanging  by  controlled  flow  a predetermined  volume  of  water  between 
the  physical  model  and  a tidal  reservoir  (sump).  Figure  2 is  a schematic 
of  the  tide  generator  and  controls.  The  programmed  command  signal  causes 
a change  in  the  vertical  position  of  the  bubble  tube  relative  to  the 
water  level  in  the  model.  This  position  change  perturbs  the  equilibrium 
position  of  the  pneumatie-hydraul ic  amplifier  and  results  in  a differential 
hydraulic  pressure  applied  to  the  hydraulic  cyclinder  activating  the 
flow-control  gate.  The  movement  of  the  flow-control  gate  is  in  a direction 
to  correct  the  perturbed  equilibrium  condition  of  the  pneumatic- hydraulic 
amplifier  by  changing  the  water-surface  elevation  in  the  tide  model.  A 
feedback  circuit  from  the  hydraulic  cyclinder  to  the  differential 
amplifier/bubble  tube  positioner  provides  a "damping  effect"  to  prevent 
gate  overshoot  and  unstable  oscillations.  Thus,  any  tidal  constituent  or 
progressive  tide  can  be  used  as  the  forcing  function  for  the  tide 
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Figure  2.  TIDE  GENERATOR  AND  CONTROLS 
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model  by  programming  ADACS  to  produce  a command  signal  harmonically 
representing  the  appropriate  forcing  function. 

3.  TIDAL  HEIGHT  SENSORS.  Data,  which  are  acquired  by  ADACS  from 
tidal  inlet  models,  consist  of  time  histories  of  water  surface  variations 
relative  to  some  reference  water  level.  For  specified  tide  conditions 
at  the  generator,  tidal  elevations  are  collected  at  selected  locations 
within  the  tidal  model.  These  data  are  used  to  calculate  mean  tide 
levels,  tidal  ranges,  arrival  times  of  high  and  low  water,  and  the  phases 
and  amplitudes  of  specific  tidal  constituents.  Although  various  types 
of  tidal  height  sensors  are  used  by  the  Hydraulics  Laboratory,  a tide 
sensor  system  developed  and  implemented  within  the  last  year  for  use 
in  tidal  inlet  models  is  presented  in  this  paper.  For  lack  of  a better 
name,  this  sensor  subsystem  has  been  labeled  the  "bubbler  system."  This 
system  measures  small  hydrostatic  pressure  changes  associated  with  changes 
in  tidal  elevations  in  the  model  and  consists  of  a high  precision,  pressure 
transducer,  a scanivalve  device  for  sequencing  input  ports,  and  48  pressure 
inputs. 

To  employ  the  bubbler  system  (Fig,  3),  a small  plastic  tube  is 
inserted  some  small  distance  into  the  water.  The  outside  diameter  of 
this  bubble  tube  is  required  to  be  small  to  minimize  blockage  of  tidal 
flow,  etc.  The  tube  is  connected  through  a throttling  valve  to  a 
regulated  pressure  supply.  This  valve  or  restriction  serves  to  regulate 
air  flow  and  isolate  the  bubble  tube  from  other  bubble  tubes  and  the 
supply  pressure.  Each  bubble  tube  is  connected  to  a common  pressure 
transducer,  which  is  parallel  with  the  throttling  valve  and  high  pressure 
source,  of  suitable  pressure  range  to  accurately  detect  the  tube’s 
internal  pressure  changes  associated  with  changes  of  water  surface  elevation. 

To  allow  many  bubble  tubes  to  share  a common  pressure  transducer,  a 
multiplexing  device,  which  is  called  a scanivalve,  is  used*  This  device 
multiplexes  sequentially  many  pressure  inputs  to  a common  output  port. 

A high  precision  pressure  transducer  is  included  as  an  integral  part  of 
the  output  port.  A controller  is  used  to  advance  the  unidirectional 
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Figure  3*  BUBBLER  SYSTEM 


stepping  motor  which  increments  the  scanivalve,  By  contact  clsoures  or 
commands  from  ADACS,  the  valve  can  be  advanced  sequentially  to  any  input 
port  or  to  a "home"  (reference)  position  without  intermediate  stops. 

The  system  presently  used  by  the  Wave  Dynamics  Division  is  capable  of 
accepting  up  to  48  pressure  inputs  and  includes  a +0.25  psid  pressure 
transducer  with  a nonlinearity  and  hystersis  (best  straight  line)  of 
0.05  percent  full  scale.  The  pressure  transducer  output  is  an  analog 
voltage  of  +10  volts  full  scale.  The  pressure  cell  is  interfaced  through 
appropriate  signal  conditioning  equipment  to  the  analog  multiplexer  of 
the  digital  recording  subsystem.  The  valve  accepts  both  home  and  step 
commands  from  the  ADACS  and  has  a BCD  position  feedback  to  the  ADACS.  The 
system  can  be  completely  controlled  by  either  ADACS  or  manual  controls. 

To  install  the  bubbler  system  in  the  model,  the  water  level  in  the 
tidal  model  is  raised  to  mean  higher  high  water  (MHHW) . At  this  still 
water  level,  the  orifice  of  the  bubbler  is  inserted  into  the  water  to  a 
depth  which  is  slightly  greater  than  the  maximum  expected  tidal  range 
(hydrostatic  head).  At  this  elevation,  the  pressure  supply  must  be  set 
high  enough  to  cause  the  emission  of  air  bubbles  from  the  tube.  For 
these  conditions,  the  tube's  internal  pressure  is  equal  to  the  hydrostatic 
pressure  of  the  water  column  above  the  orifice  of  the  bubble  tube.  The 
tube  orifice  is  cut  diagonally  to  aid  in  the  bubble1 s escape.  It  is 
important  that  the  system  bubble  freely  at  this  depth  because  the  tube’s 
internal  pressure  ceases  to  be  equal  to  the  hydrostatic  head  with  bubble 
cessation-  At  this  point,  the  bubble  tubes  should  be  observed  over 
several  tidal  cycles  to  be  certain  there  is  a continuous  stream  of 
bubbles. 

The  bubbler  system  with  the  arrangement  of  bubble  tubes,  scanivalve, 
and  a high  precision  pressure  transducer  provides  a very  economical  system 
of  obtaining  precise  measurements  of  water  surface  elevations  at  a large 
number  of  locations  in  a tidal  model.  Tidal  elevation  measurements  by 
this  system  are  accurate  to  0.001  feet.  The  sampling  sequence  of  the 
scanivalve  is  rated  at  a maximum  of  10  samples  per  second.  This  method 
of  detecting  changes  of  water  surface  elevation  is  limited  only  by  the 
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three  constant  pressure  values  over  the  tidal  range.  These  values  are 
obtained  by  setting  a bubble  tube  at  each  of  the  following  three  tide 
levels:  mean  lower  low  water,  mean  tide  level,  and  mean  higher  high 

water.  The  three  bubble  tubes  are  positioned  to  these  levels  in  a 
stilling  basin  which  is  connected  to  the  tidal  model  by  a cut-off 
valve.  Prior  to  each  tidal  test,  the  water  level  in  the  model  and 
stilling  basin  are  raised  to  mean  higher  high  water.  The  stilling  basin 
is  then  isolated  from  the  model  by  closing  the  cut-off  valve.  Finally,  the 
three  bubble  tubes  are  adjusted  to  their  appropriate  water  depth. 

Throughout  the  tidal  test  these  bubble  tubes  are  monitored  at  every 
scan  to  provide  update  calibration  data.  During  data  analysis,  cali- 
bration information  can  be  updated  by  calculating  calibration  coefficients 
for  each  scan  or  any  multiple  of  scans. 

A limited  number  of  channels  of  tidal  velocity  can  be  measured 
by  miniature,  electromagnetic  current  meters  which  are  monitored  by 
ADACS.  The  collection  of  tidal  velocities  using  ADACS  has  not  been 
fully  implemented  at  this  time  and  is  pending  the  completion  of  trans- 
ducer evaluation  which  should  be  completed  within  the  next  year.  Until 
such  time,  the  majority  of  tidal  velocity  measurements  are  obtained 
manually  by  using  a modified  version  of  the  miniature  Price  meters. 

In  addition  to  tide  data,  many  tidal  inlet  studies  require  wave 
information  as  well.  The  generation  of  waves  and  collection  of  wave 
data  at  specific  tidal  phases  (normally  high,  low,  and  mean  tide  levels) 
are  provided  by  ADACS.  While  controlling  the  tide  generator  and  collecting 
tidal  data,  ADACS  uses  in-core  timers  to  determine  the  occurrence  of  specified 
tidal  phases  at  which  times  (1)  the  wave  generators  are  turned  on, 

(2)  wave  data  at  a specified  sampling  rate  for  a predetermined  number  of 
wave  periods  are  collected  at  various  locations  in  the  model,  (3)  the 
completion  of  wave  test  for  that  tidal  phase  is  detected,  (4)  the  wave 
generators  are  turned  off,  and  (5)  in-core  timers  initialized  to  determine 
the  next  specified  tidal  phase  for  wave  tests.  These  wave  tests  are 
performed  normally  during  the  middle  cycle  of  a three-cycle  tidal  test. 

The  instrumentation  and  procedure  for  collecting  wave  data  are  the  same 
as  described  in  Reference  1. 


234 


frequency  response  of  the  system  and  the  accuracy  of  the  pressure  trans- 
ducer. Application  of  this  system  to  tidal  models  has  resulted  in  large 
dollar  savings  when  5 or  more  locations  in  a model  are  instrumented. 

4.  DATA  ACQUISITION.  During  the  acquisition  mode,  tidal  data  for 
a programmed  tidal  condition  at  the  generator  are  collected  from  a 
specified  number  of  tide  sensors,  digitized,  and  recorded  on  magnetic 
tape  or  disc  for  further  analyses.  The  sampling  scheme  is  flexible 
and  can  be  tailored  for  different  applications  with  maximum  thru-put 
rates  theoretically  limited  by  the  multiplexing  rate  of  the  scanivalve. 
The  present  sampling  scheme  is  to  (a)  increment  the  scanivalve  to  the 
first  data  channel,  (b)  delay  a specified  time  interval  (normally  0,5 
sec)  to  allow  input  pressure  to  stabilize,  (c)  collect  a specified 
number  (normally  10)  of  samples,  (d)  average  these  voltage  samples, 

(e)  store  the  discrete  sample  in  memory,  (f)  increment  to  the  next 
channel,  (g)  repeat  the  above  procedure,  and  (h)  continue  sequentially 
through  remaining  channels.  For  each  tide  sensor,  100  discrete  voltage 
samples  are  collected  at  equally  spaced  intervals  over  each  tidal  cycle 
for  a predetermined  number  of  cycles  (normally  3 to  5) . The  minicom- 
puter calculates  from  input  parameters  (1)  the  required  timing  interval 
between  multiplexing  scans  of  the  scanivalve  to  provide  the  correct 
sampling  rate,  (2)  the  delay  interval  at  each  channel,  and  (3)  the 
number  of  voltage  samples  to  be  digitized  and  averaged  and  initializes 
counters  for  determining  completion  of  tidal  tests.  In  addition,  it 
provides  an  analog  command  signal  through  the  digital  to  analog  con- 
verter to  the  tide  generator  and  lags  the  beginning  of  data  acquisition 
by  a specified  number  of  tide  cycles  after  starting  the  generator. 

Due  to  thermal  effects  (zero  drift)  on  the  transducer  output  over  a 
tidal  test  of  2 to  3 hours  duration,  the  pressure  transducer  is  cali- 
brated prior  to  and  at  selected  time  intervals  during  each  tidal  test  to 
provide  accurate,  update  calibration  data  for  scaling  voltage  (pressure) 
to  tidal  elevations.  The  calibration  data  are  obtained  by  monitoring 
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At  completion  of  the  acquisition  mode,  the  calibration,  wave,  and 
tide  data  have  been  recorded  in  binary  form  on  magnetic  tape  or  disc. 
These  data  with  a header  for  test  identification  and  pertinent  param- 
eters are  available  from  disc  or  magnetic  tape  for  analyses. 

5.  DATA  ANALYSES,  Analyses  of  the  elevation  and  velocity  data 
from  tidal  model  are  performed  by  either  the  minicomputer  subsystem  or 
a Honeywell  G635  of  the  Automated  Data  Processing  Center  at  WES. 
Schematically,  the  automated  procedures  for  analyzing  tidal  data  are 
as  follow: 

I.  Program  Initialization 

(1)  Input  test  parameters  and  option  flags. 

(2)  Read  and  decode  data  tape  or  disc  file. 

(3)  Demultiplex  data  files  and  scale  data. 

II.  Tidal  Data  Analyses 

(1)  Harmonic  analysis  using  Least  Squares  techniques. 

(a)  Amplitude  and  phases  of  tidal  constituents. 

(b)  Relative  phases  between  gages* 

(23  Analyses  of  residual  variances. 

(a)  Original  versus  Least  Square  estimate. 

(b)  Prototype  tide  versus  model  tide, 

(c)  Model  base  test  versus  model  plans. 

(3)  Graphic  output  of  above  results. 

In  addition  to  the  above  automated  procedures,  manual  and  photo- 
graphic techniques  are  employed  in  tidal  models  to  study  general  patterns 
of  tidal,  circulation  and  to  define  qualitatively  littoral  transport 
and  deposition  patterns. 

The  analyses  of  data  from  wave  models  are  presented  by  Durham  and 
Greer  (1975)  and  are  basically  auto-spectral  and  cross-spectral  analyses, 
statistical  analyses  for  wave  heights  and  periods  of  wave  signals  at 
selected  locations  throughout  the  model,  and  computation  of  response 
functions  or  amplification  factors  from  wave  energy  within  the  harbor 
or  tidal  inlet  relative  to  incoming  wave  energy.  In  addition  to 
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analyzing  data  from  wave  models,  these  procedures  are  used  also  in 
analyzing  wave  data  which  are  generated  and  acquired  (refer  to  previous 
section)  at  selected  tidal  phases  and/or  tidal  ranges  during  specific 
tide/wave  tests  in  the  tidal  model, 

6,  MODEL  APPLICATION.  The  hydraulic  tidal  model  is  used  as  an 
engineering  tool  in  predicting  the  effects  of  proposed  engineering  changes 
(channel  dredging,  inlet  geometry  changes,  and  construction  of  coastal 
structures)  and  in  planning  and  designing  the  proposed  changes  in  a 
cost  effective  and  environmentally  compatible  manner.  For  physical 
models  of  tidal  inlets  to  be  most  effective  as  a predictive  tool, 
verification  of  such  models  is  desirable.  Verification  of  hydraulic 
tidal  inlet  models  requires  the  hydraulic  model  to  reproduce  observed 
prototype  conditions  of  tidal  elevations,  tidal  phases,  and  average 
mass  distributions  at  specified  locations  and/or  cross  sections  in 
the  model.  Hydraulic  tidal  inlet  models  are  scaled  on  the  Froude  model 
law  and  are  usually  geometrically  distorted  models  in  which  the  hori- 
zontal length  scale  is  not  the  same  as  the  vertical  length  scale.  Such 
modeling  procedures  require  artifically  simulating  in  the  model  the 
frictional  effects  associated  with  prototype  roughness  of  bottom  and 
sides  of  channels  and  overbank  and  marshland  roughness. 

Various  procedures  are  available  for  inducing  artifically  the  frictional 
effects  3,4  in  the  hydraulic  tidal  model.  One  such  procedure  is  the  use 
of  roughness  (drag)  elements  which  are  small  strips  of  metal  that  are 
attached  vertically  to  the  model  bottom.  The  practice  of  using  such 
roughness  strips  for  simulating  frictional  effects  in  tidal  inlet  models 
is  a standard  and  accepted  procedure  in  physical  modeling.  The 
successful  application  (required  number  and  horizontal  distribution  of 
roughness  strips)  of  this  procedure  most  frequently  depends  upon  the 
modeling  experience  of  the  hydraulic  engineer  and  requires  lengthy 
testing  programs.  A means  of  estimating  theoretically  and/or  empirically 
the  required  number  of  roughness  elements  and  their  horizontal  distribution 
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is  a needed  capability  in  physical  modeling.  One  method  of  estimating 
the  uniform  horizontal  distribution  (number  of  elements  per  square  foot) 
of  roughness  elements  in  a distorted  hydraulic  model  has  been  proposed 
by  Multer.5  Although  this  procedure  is  based  on  one-dimensional  tidal 
flow,  it  warrants  additional  laboratory  investigation  and  study.  One 
desirable  refinement  is  to  be  able  to  define  the  variation  in  the  horizontal 
distribution  of  roughness  elements  from  an  idealized  uniform  horizontal 
distribution.  The  relative  phase  of  the  tide  at  different  locations 
along  a tidal  channel  relative  to  a reference  location  in  the  channel 
can  be  used  as  one  parameter  to  estimate  the  horizontal  distribution 
of  roughness  elements.  Having  placed  a uniform  horizontal  distribution 
of  roughness  elements  in  a tidal  hydraulic  model,  comparison  of  the 
model  and  prototype  relative  phases  of  a dominant  tidal  constituent 
between  two  locations  along  a specific  channel  can  provide  information 
as  to  the  change  in  the  amount  of  model  roughness  required  to  reproduce 
correctly  the  prototype  tidal  phase  along  the  specific  reach  of  channel 
between  the  two  chosen  points.  Thus,  the  relative  phase  lags  or  leads 
of  major  tidal  constituents  between  selected  locations  throughout  the 
tidal  inlet  can  provide  similar  information  for  the  various  channel 
reaches  in  the  model.  If  sufficient  prototype  data  (amplitudes  and  phases 
of  various  tidal  constituents)  are  available,  much  information  and 
guidance  as  to  the  horizontal  distribution  of  roughness  elements  in 
the  tidal  hydraulic  model  can  be  obtained  from  the  above  procedure. 

These  procedures  for  defining  the  horizontal  distribution  of  model 
roughness,  full  model  verification,  and  model  testing  using  ADACS  and 
associated  automation  procedures  were  successfully  applied  in  hydraulically 
modeling  Murrells  Inlet,  South  Carolina.  A fixed-bed  tidal  model  study 
was  conducted  to  estimate  the  effects  of  proposed  engineering  changes 
in  the  mouth  of  the  inlet  on  the  tide  and  wave  regime  in  the  inlet. 

The  prototype  area  of  interest  was  approximately  1.5  miles  inland  and 
6.5  miles  along  shore.  A distorted  physical  model  with  an  area  of 
approximately  21,800  sq  ft  was  scaled  at  1 to  60  vertically  and 
1 to  100  horizontally  and  constructed  with  molded  topography  reproduced 
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to  -25  ft  offshore  and  +10  ft  on  land.  Good  prototype  data  was  available 
from  National  Ocean  Survey  of  NOAA  for  8 tide  gages  within  the  tidal 
inlet.  Figure  4 is  a schematic  of  the  model  and  the  locations  of  the 
various  tide  gages.  The  tidal  regime  of  this  inlet  is  dominated  by 
the  principal  lunar  semidiurnal  constituent,  M2  , whose  variance  repre- 
sents over  85  percent  of  the  tidal  variance  in  the  inlet. 

In  the  verification  of  Murrells  Inlet  model,  the  major  tidal  con- 
stituent, , and  its  overtides,  , M&  , and  Mg  were  used  in  the 
initial  verification  tests.  After  estimating  and  placing  a quasi- 
uniform distribution  of  roughness  strips  in  the  model,  the  tide  generated 
in  the  model  was  composed  of  the  , and  Mg  tidal  con- 

stituents. The  tidal  coefficients  (amplitudes  and  phases)  for  these  and 
20  other  tidal  constituents  were  calculated  by  National  Ocean  Survey 
from  prototype  data  collected  at  the  7 tidal  gages  within  Murrells  Inlet 
and  one  tide  gage  just  outside  the  mouth  of  the  inlet.  The  instantaneous 
height  (elevation)  of  the  prototype  tide  at  a specific  location  in  the 
inlet  can  be  represented  by  the  following  equation. 

TIDAL  HEIGHT: 

N 

h(t)  - H + l f.H.  COS  [a.t  + (V  + u) * - tc.  ] 

J o .,11  Li  ^ o 'i  iJ 
1=1 

WHERE 

h = Tidal  height  at  time  t^  . 

H - Mean  height  above  reference  datum. 

Q th 

H^  = Mean  amplitude  of  i constituent. 

f.  = Factor  to  reduce  mean  amplitude  to  year  of  prediction 
= Angular  speed  of  i constituent. 

t = Time  reckoned  from  some  initial  epoch. 

(VQ+u)^  = Equilibrium  argument  of  i ^ constituent  for  t=0, 

- Local  epoch  of  i constituent. 

N = Total  number  of  constituents. 

Harmonic  analysis  performed  by  National  Ocean  Survey  on  prototype  data 
provide  , H^,  and  for  each  tidal  gage.  The  other  coefficients, 
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, and  (Vq+u)^  , can  be  obtained  from  appropriate  tables.  The 
proceeding  equation  can  be  rewritten  in  the  following  form: 

N 

h(t)  = H + E Ai  COS  (u>.t  + 4..) 
i=l 

WHERE 

h - Tidal  height  at  time  t. 

H = Mean  height  above  reference  datum 

^ t Jl 

A,  = f.H.  = Amplitude  of  i constituent. 

11  tli 

- (V  +u)  - - k • = Phase  of  i constituent. 

This  form  of  the  instantaneous  tidal  height  is  used  in  model  control 
and  data  analyses  of  the  hydraulic  tidal  model.  For  Murrells  Inlet, 
the  harmonic  function  composed  of  the  , M^,  and  Mg  tidal 

constituents  at  Gaga  8 was  used  as  a command  signal  to  the  tide 
generator  in  producing  the  model  tide  forcing  function  for  the  initial 
verification  tests. 

At  each  of  the  gage  locations,  the  instantaneous  tidal  heights  were 
recorded  simultaneously  for  several  tidal  cycles.  The  instantaneous 
tidal  height,  h^(t),  at  each  gage  location  in  the  model  can  be  represented 
as  follows: 

, N 

hMCt)  = * £(t)  5 ao  + S [ai  C0Sfwit)  + bi  SIN  (V)]  + e(t) 

i~l 

where  h^(t)  is  the  calculated  tidal  height,  which  is  represented  by 
a harmonic  series  of  known  frequencies,  and  c(t)  is  noise  associated 
with  the  transfer  function  of  the  hydraulic  model,  etc.  in  the  tidal 
record.  With  an  undetermined  noise  level  in  the  tidal  height  record, 
the  principal  of  least  squares  can  be  used  to  solve  for  the  unknown 
coefficients  (amplitudes  and  phases)  for  the  , and  Mg 

tidal  constituents  by  minimizing  the  variance  or  the  sum  of  the  squared 
differences  between  the  measured  model  tidal  height  and  the  assumed 
form  of  the  model  tidal  height*  The  application  of  the  principal  of 
least  squares  is  as  follows: 
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E 


[hM(t)  - hM(t)]2dt  MINIMAL 


T = RECORD  LENGTH 


REQUIRES 


_3E 

3a 

0 


. a.  9E  * AM  9E 
3a.  = 0;  AND  3b. 

l l 


= 0 


Thus,  have  2N+1  normal  equations  for  aQ,  a^,  and  where  i=l,  . *.,N 
[C]  [A]  = [F] 

C = Real,  symmetric  matrix  of  sine  and  cosine  products 
A - Vector  of  coefficients 

P = Vector  of  products  of  observed  signal  and  sine  or  cosine  terms 
N - Total  number  of  constituents 


The  above  set  of  equations  can  be  solved  by  routine  procedures  of 
matrix  inversion.  With  the  model  data  in  digital  form  from  ADACS,  this 
method  of  analysis  is  very  easily  performed  by  ADACS* 

Having  obtained  the  harmonic  coefficients  for  the  model  tidal 
height  at  each  tide  gage,  the  relative  phases  of  the  tidal  constituents 
at  each  gage  relative  to  the  tidal  gage  (Gage  #8)  at  the  inlet  mouth 
and  the  differences  in  these  relative  phases  in  the  model  and  the  proto- 
type can  be  determined.  Tables  1 and  2 give  the  amplitudes  and  model 
to  prototype  differences  in  relative  phases  of  the  tidal  constituent 

for  model  tests  #1,  #36,  and  #73*  In  addition,  the  relative  phases  between 
any  two  tidal  gages  can  be  obtained,  and  the  difference  of  these  relative 
phases  in  prototype  and  model  can  be  calculated.  Table  3 gives  the 
model  to  prototype  differences  of  the  relative  phases  of  the  tidal 
constituent  for  appropriate  gages  for  model  tests  #1,  #36,  and  #73. 

In  model  test  #1,  quasi-uniform  horizontal  distribution  of  roughness 
elements  was  placed  in  the  hydraulic  model  of  Murrells  Inlet.  From 
Tables  2 and  3,  the  channel  reaches  (model  area  between  two  gages) 
requiring  more  or  less  roughness  elements  can  be  determined  by  considering 
the  magnitude  and  sign  of  the  model  to  prototype  differences  in  relative 
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Table  1 


112  CONSTITUENT  TIDAL  AMPLITUDES  (FT) 


RUN  NO.  1 RUN  NO.  36  RUN  NO.  73 


STATION 

PROTOTYPE 

MODEL 

difference 

MODEL 

DIFFERENCE 

MODEL 

DIFFERENCE 

1 

L768 

2.150 

+ 0.362 

1.804 

+ 0,016 

1.716 

-0,072 

2 

1.834 

2.140 

+ 0.306 

1.922 

+0.088 

1.789 

-0.045 

3 

1.866 

2.070 

+ 0.204 

1.916 

+ 0.050 

1.797 

-0.069 

4 

1.919 

2.260 

+ 0.341 

1.974 

+ 0.055 

1.878 

-0.041 

5 

1.885 

2.220 

+0.335 

1.957 

+ 0.072 

1.838 

-0.047 

6 

1.936 

2.270 

+ 0,334 

1.986 

+ 0.050 

1.872 

-0.064 

7 

1.865 

1.950 

+0.085 

1.885 

+ 0.020 

1.819 

-0.066 

8 

2.402 

2.430 

+ 0.028 

2.412 

+ 0.010 

2.396 

-0.006 

STATION 

TO 

STATION 

Table 

M2  CONSTITUENT  PHASE 
RUN  NO.  1 

2 

DIFFERENCES  (DEG*) 
RUN  NO.  36 

RUN  NO.  73 

PROTOTYPE 

MODEL 

DIFFERENCE 

MODEL 

DIFFERENCE 

MODEL 

DIFFERENCE 

8-1 

48.23 

30.50 

-17.73 

45,14 

-3.09 

49.24 

+ 1,01 

8-2 

33,40 

26.50 

- 6.90 

32,20 

-1.20 

34.20 

+0.80 

8-3 

19.52 

13.60 

- 5.72 

19.20 

-0.32 

20,20 

*0.68 

8-4 

20,69 

13.80 

- 6,89 

21.40 

+ 0.71 

20.30 

-0.39 

8-5 

32.66 

23.90 

- 8.76 

32.30 

-0.36 

33.10 

+ 0.44 

8-6 

33.23 

21.20 

-12.03 

32.00 

-1.23 

32.50 

-0.73 

8-7 

47.09 

41.00 

- 6.09 

44.46 

-2.63 

46.80 

-0.29 

• 1 DEGREE  - 2.07  MINUTES  OF  PROTOTYPE  TIME 
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Table  3 


M2  CONSTITUENT  PHASE  DIFFERENCES  (DEG*) 


STATION 

PROTOTYPE 

RUN  NO.  1 

RUN  NO.  36 

RUN  NO.  73 

TO 

STATION 

MODEL 

DIFFERENCE 

MODEL 

DIFFERENCE 

MODEL 

DIFFERENCE 

2-1 

14.83 

4.60 

-10.23 

12.94 

-1.89 

15.04 

+ 0.21 

3-1 

28.71 

17.30 

-11.41 

25.94 

-2.77 

29.04 

+ 0.33 

3-2 

13.88 

12.70 

- 1.18 

13.00 

-0.88 

14.00 

+ 0.12 

4-5 

11.98 

10.10 

- 1.88 

10.90 

-1.08 

12.80 

+ 0.82 

4-6 

12.54 

7.40 

- 5.14 

10.60 

-1.94 

12.20 

-0.34 

4-7 

26.38 

27.20 

t 0.82 

23.06 

-3.32 

26.50 

+ 0.12 

5-6 

0.57 

-2.70 

- 3.27 

-0.30 

-0.87 

-0.60 

-1.17 

6-7 

13.84 

19.70 

+ 5.86 

12.46 

-1.38 

14.30 

+ 0.46 

* 1 DEGREE  = 2.07  MINUTES  OF  PROTOTYPE  TIME 


Table  4 


M2  CONSTITUENT  MEAN  TIDE  LEVELS 
(FEET  ABOVE  MLW) 


RUN  NO.  1 

RUN  NO.  36 

RUN  NO.  73 

STATION 

PROTOTYPE 

MODEL 

DIFFERENCE 

MODEL 

DIFFERENCE 

MODEL 

DIFFERENCE 

1 

2.668 

2.833 

+ 0.165 

2,499 

-0.169 

2.734 

+ 0.046 

2 

2.674 

2.831 

+ 0.157 

2.427 

-0.247 

2.695 

+ 0,020 

3 

2.696 

2.632 

-0.062 

2.249 

-0.447 

2.546 

-0.150 

4 

2.715 

2.733 

+ 0.018 

2.380 

-0.335 

2.663 

-0.052 

5 

2.696 

2.794 

i 0.098 

2.423 

-0.273 

2.682 

-0.014 

6 

2.680 

2.755 

+ 0.075 

2,387 

-0.293 

2.655 

-0.025 

7 

2.676 

3.012 

+ 0.336 

2.523 

-0.153 

2.762 

+ 0,086 

8 

2.344 

2.557 

+ 0.213 

2.128 

-0.216 

2.365 

+ 0.021 
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phases  between  specific  gages.  In  addition  to  relative  phase  differences, 
the  tidal  amplitudes  (Table  1)  and  mean  tide  levels  (Table  4)  must  be 
considered  in  adjusting  the  model  forcing  function  and  the  model  rough- 
ness during  verification.  After  model  test  #73,  the  hydraulic  model  is 
considered  verified  for  the  tidal  constituent  and  its  overtides. 

Tidal  amplitudes  and  mean  tidal  levels  in  the  model  agree  to  within 
0.1  feet  of  prototype  measurements  and  relative  phase  differences  have 
been  reduced  to  one  degree  or  less.  Figures  5 and  6 show  plots  of 
model  and  prototype  tidal  heights  before  and  after  model  verification 

at  Gages  8 and  2,  respectively.  These  plots  show  that  corrections  for 
mean  tide  levels,  tidal  ranges,  and  tidal  phases  were  performed  during 
verification  in  getting  the  hydraulic  tidal  model  to  reproduce  prototype 
conditions. 

The  above  verification  procedure  is  repeated  using  a progressive 
tide  as  the  model  tidal  forcing  function.  Since  the  M2  tidal  constituent 
represents  more  than  85  percent  of  the  tidal  variance  in  Murrells  Inlet, 
minimum  adjustments  to  the  model  roughness  are  required  for  model  veri- 
fication using  a progressive  tide.  The  harmonic  analyses  of  a progressive 
tide  requires  a much  longer  tide  record  (approximately  15  days)  than 
the  harmonic  analyses  of  the  tidal  constituent  and  its  overtides. 

Thus,  verifying  initially  with  the  tide  (dominant  tidal  constituent) 

reduces  the  total  time  required  for  model  verification.  In  addition, 
using  the  relative  phase  differences  to  isolate  model  areas  requiring 
changes  in  roughness  elements  has  decreased  the  total  time  required  to 
verify  a hydraulic  tidal  model. 

Velocity  verification  can  be  performed  in  the  hydraulic  model  using 
the  same  procedure  as  presented  above  for  the  tidal  heights.  Frequently, 
insufficient  data  exist  for  the  tidal  velocity  regime  to  be  analyzed 
for  the  various  tidal  constituents.  In  most  cases,  twenty-four  hours 
of  tidal  velocity  data  exist  for  appropriate  cross  sections  in  the  various 
channels  of  the  tidal  inlet.  For  such  cases,  a progressive  tide  can  be 
generated  in  the  model  after  tidal  height  verification.  This  progressive 
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BEFORE  VERIFICATION 


MURRELLS  INLET  RUN  NUMBER  1 GAGE  NUMBER  M“08 


AFTER  VERIFICATION 

MURRELLS  INLET  RUN  NUMBER  73  GAGE  NUMBER  M-08 
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BEFORE  VERIFICATION 

MURRELLS  INLET  RUN  NUMBER  1 GAGE  NUMBER  M-02 


T - 1 ' I 1 1 1 1 J 1 i 

0.00  5.00  10.00  15.00  20.00  25.00  30.00  35.00  MG. m 


TIME  PROTOTYPE  (HRS) 


AFTER  VERIFICATION 

MURRELLS  INLET  RUN  NUMBER  73  GAGE  NUMBER  M-02 


0.00  5.00  10.00  15.00  20.00  25 . 00  30.00  35.0G  40.00 

TIME  PROTOTYPE  (HRS) 


Figure  6.  Tidal  Elevation  in  Back  Reach  of  Estuary. 


tide  can  be  made  to  correspond  to  the  appropriate  progressive  tide  for 
the  particular  twenty-four  hours  of  interest.  Then,  tidal  velocity 
measurements  and  required  model  adjustments  can  be  performed  to  obtain 
model  reproduction  of  prototype  tidal  velocity. 

The  first  application  of  the  verification  procedure,  which  is 
presented  in  this  paper,  to  a hydraulic  tide  model  of  Murrells  Inlet, 

South  Carolina,  was  quite  successful  and  demonstrated  that  a reduction 
of  required  time  for  tidal  model  verification  can  be  obtained  by  quali- 
tative procedures  of  defining  required  model  roughness  and  model  areas 
requiring  such  roughness.  The  present  procedure  with  its  complex  method 
of  analyses  requires  the  services  of  an  ADACS  for  practical  application 
within  the  time  frame  of  typical  hydraulic  model  studies.  Although 
the  methodology  presented  in  this  paper  has  shown  much  promise,  additional 
model  applications  and  laboratory  experiments  are  needed  to  refine 
the  application  of  the  technique  and  to  provide  improved  procedures 
for  quantitative  estimates  of  the  horizontal  distribution  of  model 
roughness  in  distorted,  hydraulic  tidal  models. 

7.  SUMMARY.  Application  of  ADACS  to  physical  modeling  techniques 
for  tidal  inlet  studies  has  (1)  reduced  the  required  time  for  model 
verification  and  testing  with  a related  cost  reduction  in  model  studies, 

(2)  increased  the  quality  and  quantity  of  model  data,  and  (3)  allowed 
more  sophisticated  procedures  for  model  control  and  data  analyses  with 
an  improvement  in  information  from  model  tests  on  which  can  be  based 
engineering  decisions  regarding  the  planning  and  design  of  proposed 
channel  dredging,  inlet  geometry  changes,  and  the  height,  length, 
alignment,  and  orientation  of  coastal  structures  (e.g.,  jetties  and 
breakwaters).  Initial  efforts  in  attempting  to  quantify  the  artificial 
simulation  of  prototype  frictional  effects  in  geometrically  distorted 
hydraulic  tidal  models  have  shown  much  promise  in  the  first  application 
of  the  relative  phase  difference  procedure,  reported  in  this  paper,  to 
the  tidal  model  of  Murrells  Inlet,  South  Carolina.  Additional  refine- 
ments to  this  procedure  are  required,  and  future  model  applications  and/or 
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laboratory  studies  are  desirable  to  define  such  improvements*  The 
automation  efforts,  described  in  this  paper,  of  physical  hydraulic  models 
for  wave  and  tidal  inlet  studies  has  been  highly  successful  in  improving 
modeling  techniques,  enhancing  modeling  capabilities,  and  increasing 
the  efficiency  of  such  procedures  through  time  and  cost  savings. 

Proposed  future  efforts  in  model  automation  at  WES  include  improved 
sensors,  spectral  wave  generation,  and  an  expansion  of  these  automated 
capabilities  to  other  model  facilities. 
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OPTIMAL  INSTRUMENTATION  PLANNING  USING  AN  LDLT  FACTORIZATION 


W i 1 1 i rtiTi  S . A g a c , R ob evt  H . Tu rner,  a nd  J o r ry  L * M ever 
An  a ly  s :i.  s and  Compu  L at;  i on  I)  i v .1  s i.  on 
H at  i on  a 1 Ran  go  Op  c r t i on  s D i re  c t orate 
US  Army  White  Sands  Missile  Range 
White  Sanaa  Missile  Range,  II M 8S002 

ABSTRACT . Optimal  instrumentation  planning  procedures  have  been  develop- 
ed at  White  Sands  Missile  Range  (WSMR)  for  dovap , radar,  and  cinetheodo.il  te 
instrumentation  systems*  These  procedures  use  an  Instrumentation  Plan  (IV) 
improvement  algorithm.  The  IP  improvement  algorithm  was  originally  based  on 
an  update  procedure  for  the  IP  covariance  matrix.  The  large  number  of  avail- 
able instrumentation  sites  require  a great  amount  of  CPU  time  and  storage 
using  this  update  procedure*  in  addition,  numerical  instabilities  are  some- 
times encountered.  Recently,  we  have  redeveloped  the  IP  improvement  algorithm 
based  on  updating  the  IP  in format ion  matrix.  The  special  structure  of  tie 
information  matrix  and  use  of  an  LDL/i  factorization  of  the  information  matrix 
have  reduced  the  CPU  time  by  a factor  of  three,  greatly  reduced  the  storage 
requirements  and  eliminated  the  numerical  instabilities. 

1 , 1 NT RO DUCT  TON . Trajectory  estimation  is  a fundamental  part  of  the 

mission  at  White  Sands  Missile  Range.  In  order  to  do  a good  job  of  trajectory 
estimation,  it  is  absolutely  essential  to  have  a good  instrumentation  plan  (IP) 
The  primary  instrumentation  systems  at  WSMR  are  radar,  dovap,  and  cinetheobo- 
11 to.  For  each  of  these  systems  the  instrumentation  planning  problem  can  be 
stated  informally  as:  given  a nominal  trajectory  of  flight  path  select' the 

instrument  sites  from  among  those  available  for  the  instrumentation  system  so 
as  to  meet  the  range  users  data  requirements.  The  range  users  data  require- 
ments are  usually  stated  in  terms  of  the  precision  or  accuracy  of  the  trajec- 
tory estimates.  In  the  past  this  instrumentation  planning  problem  has  been 
handled  by  the  use  of  various  ad  hoc  techniques.  The  resulting  instrumentation 
plans  were  usually  adequate  to  meet  the  range  users  requirements,  but  they  were 
often  wasteful  in  the  sense  that  an  equally  good  instrumentation  plan  c.ouid 
have  been  developed  using  fewer  instrument  sites. 

Recently  we  have  developed  some  techniques  which  formalize  the  instrumen- 
tation planning  problem  for  the  three  primary  instrumentation  systems.  Our 
techniques  solve  the  following  problem.  Given  a set  of  N time  points  t-; 
en t i r c ly _ covering  the  nominal  flight  path  and  the  corresponding  position 
vectors  xj. , i=l,  N to  the  flight  path  select  a set  of  M instrument  sites  which 
minimize 


N 

C = V w?trcov(x<)  v.\>0 
i=l  1 

estl m a t « a ■ w h i c h 
processed.  The 
trajectory 


The  quantity  C may  be  interpreted  as  the  weighted  sum  of  error 
would  re.su. 'Lr  if  measurements  from  the  nominal  flight  p:Uh  were 
wp  arc  a set  of  weights  used  to  attach  more  importance  to  some 
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points  than  to  other's,  cov(xp)  is  the  3>'3  covariance  matrix  of  the  errors  in 
the  estimate  of  the  trajectory  position  vector  . 

The  number  of  surveyed  instrument  sites  for  each  of  the  instrumentation 
systems  is  approximately:  (a)  radar  - 15,  (b)  dovap  - 650,  and  (c)  cinetheo- 

dolite  - 230.  It  is  obvious  that  the  selection  of  M sites  from  the  large 
number  of  available  sites  would  usually  be  computationally  prohibitive  if  a 
procedure  of  enumeration  and  examination  of  all  possible  combinations  of  M 
sites  is  used  to  achieve  a global  minimum  of  C.  Rather  than  pursuing  a global 
minimum,  we  have  satisfied  ourselves  with  obtaining  a local  minimum  of  C 
through  the  use  of  an  instrumentation  plan  improvement  algorithm. 

Instrumentation  Plan  Improvement  Algorithm 

a.  Given  an  arbitrary  initial  IP  having  M instruments  construct  a 
modified  IP  having  M+l  instruments  by  adding  the  instrument  site  from  those 
available  in  an  instrumentation  planning  pool  (IPP)  which  results  in  the  great- 
est decrease  of  C . 

b.  Delete  the  instrument  site  from  the  modified  IP  which  results  in 
the  smallest  increase  of  0. 

c.  Repeat  the  exchange  procedure  between  the  IPP  and  the  IP  given 
in  steps  a.  and  b.  until  no  farther  improvement  is  possible. 

The  minimum  achieved  by  the  IP  improvement  algorithm  is  local  in  the 
sense  that  it  is  dependent  on  the  initial  IP  with  which  the  algorithm  started. 

The  IPP  is  a set  of  instrument  sites  considered  feasible  to  use  for  the 
geometry  of  the  given  flight  path.  The  IPP  is  obtained  from  the  set  of  all 
existing  sites  by  placing  some  basic  constraints  on  site  selection.  The  con- 
siderations  which  go  into  these  basic  constraints  are : 

( 1 ) Dovap : 

(a)  Reference  signal  strength  available  at  the  receiver  sites, 

(b)  Antenna  nulls  at  low  and  high  elevation  angles. 

(2)  Radar:  Ground  clutter  at  low  elevation  angles. 

(3)  Cinetheodolite : 

(a)  Minimum  image  size  readable  on  film, 

(b)  Maximum  tracking  rates. 

(c)  Sun  angle. 

(d)  Flight  safety  evacuation  area* 
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The  effect  of  the  addition  and  deletion  of  instrument  sites  on  the 
objective  function  C required  in  steps  a.  and  b*  of  the  IP  improvement  algo- 
rithm requires  a gr-eat  deal  of  computation  even  though  the  number  of  sites  to 
be  considered  has  been  reduced  through  the  use  of  basic  constraints  in  forming' 
IIP.  Each  time  a station  is  added  or  deleted  the  entire  3NX3N  IP  covariance 
matrix  must  be  updated.  In  a previous  approach  to  updating  the  IP  covariance 
matrix  a direct  update  was  used.  The  use  of  this  direct  update  required  ex- 
cessive computation  time.  Also,  severe  numerical  instability  was  encountered 
when  using  this  update  for  radar  site  selection.  Because  of  these  problems 
the  direct  update  was  discarded  in  favor  of  a method  which  updates  the  IP 
information  matrix.  In  all  cases  the  IP  information  matrix  hEis  a special, 
sparse  structure  therefore  requiring  much  less  computation  for  an  update.  The 
numerical  instabilities  are  also  eliminated  using  this  update  procedure. 


The  remainder  of  the  paper  is  devoted  to  describing  the  implementation  of 
the  IP  improvement  algorithm  using  the  information  matrix  update.  First, 
measurement  models  for  the  instrumentation  systems  will  be  described.  Follow- 
ing this,  the  batch  processors  used  to  estimate  the  trajectory  and  their 
information  and  covariance  matrices  will  be  briefly  described.  Finally,  the 
numerical  procedures  for  updating  the  information  matrices  will  be  given. 

2*  • MEASUREMENT  MODELS . Each  dovap  receiver  measures  a loop  range  change 
between  successive  trajectory  points.  Thus,  the  measurement  function  for  the 
ath  receiver  is  a function  of  the  position  vector  xp  and  xf+i  to  the  trajectory. 
Denote  the  measurement  function  by  ga(xi>xi+l)*  The  dovap  observation,  for  the 
ith  trajectory  interval,  denoted  by  Zu(ti,ti+i)  is 


WW  = e0(*i>*i+i)  + VW 


(1) 


where  na(ti+j_)  is  a zero  mean  measurement  noise  term  having  variance  Ra(i+1). 


Let  ga(xp)  be  a p -vector  of  measurement  functions  for  the  a^h  radar  or 
cine  site  at  the  i^h  trajectory  point.  Then  the  p-vector  of  radar  or  cine 
observations  at  the  ith  trajectory  time  is 


W 


s«(b)  + ba  + W 


(2) 


where  ba  :i.s  p-vector  of  zero-set  measurement  biases  for  the  ath  site  and 
na(t^)  is  a-  zero-mean  measurement  noise  vector  having  diagonal  covariance 
Ra(i).  For  cine  measurements  p=2  and  the  components  of  ga(-)  are  azimuth  and 
elevation  angles.  For  radar  p=3  and  the  components  of  ga(-)  are  range,  azimuth, 
and  elevation. 


3 • DOVAP  BATCH  PROCESSOR  AND  INFORMATION. MATRIX  UPDATE, 
process  or  m in imi zes 


The  dovap  batch 


Q(x. ,x0, 


N 

V - l 


M 

I 


i-2  a-i  (Za<i-l»ti)  " (3) 


The  information  matrix  of  this  batch  processor  is  a 3N*3N  block  tridiagonal 
matrix 
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where  the  Af's  and  Afj's  are  3^3  and  are  functions  of  the  trajectory  position 
vectors,  the_measurement  noise  variances,  and  the  partial  derivative  vectors 
Gol(xi)=3ga(x.i,xi+i)/3xi  and  Ga2(xi+l)=3ga(xi  >xi+l)/3xi+l.  The  information 
matrix  A is  factored  as  A=LDLT  where  L is  unit  lower  triangular  and  D is  diag- 
onal. The  special  structure  of  A allows  L to  be  chosen  as 


\ 


\ 

\ ^ 

L\ 

N,N-1 


where  the  Lj_  are  3*3  unit  lower  triangular  and  the  L^j  are  3X3.  The  matrix  D 
is  partitioned  into  3*3  diagonal  blocks  Dj_.  The  Li,  Lij  , and  Di  are  computed 
from 


Wi  = ai 


T T 

L.D.L*  = A.  - L.  . .D.LT  . , 
l i i i i5i-l  i i,i-l 


L.D.lT  . = A.  . 

1 1 1+1,1  1,1+1 


(6a) 

(6b) 

(6c) 


Adding  or  deleting  the  receiver  site  during  the  K^h  observation  inter- 
val yields  the  information  matrix  update  equation 


A+  = A ± 


Je(xK-l,Xl<)  J3(XK-l’XK) 


(7) 


where  A + is  the  information  matrix  after  adding  a measurement  (+)  or  deleting 
a measurement  (-).  J0(xk~i,xk)  is  a 3N-vector  of  the  form 
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(8) 


Vvi’V  = Co  0 


GGf5l/XK~P  ~ ~ 0] 

<K-l)St  Kth 
block  block 


~T>- 


j^(xK-15xk)  partitioned  into  3-vectors  with  the  only  nonzero  entries  being 
in  the  (k-l)3t  and  kth  positions.  If  A±  is  factored  as  L-^D-L^  then 


L+D+L+T  = LDLT  + Jg(xK1  >XJ<)J3^K„1  >*K) 


(9) 


Equating  block  elements  on  both  sides  of  the  last  equation  gives  the  update 
equations  for  Li,  Lij,  arid  Di  as 


+ 

LT  = q 
DT  - D. 

i l 

± 

Li+l,i  ” Li+l,iJ 


> i<K-l 


(10a) 

(10b) 

(10c) 


LK-1DK-1LK-1  = LK-1DK-1LK-1  G31(xK-ljG6l<'XK-l) 
LK-1DK-1LK,K-1  = LK-1DK-1LK,K-1  1 G81(‘XK-l)Gf52^XK) 


(11) 

(12) 


LKDKLK  = LKDKLK  + LK5K-lDK-lLK5K'l  lk,k-idk-ilk5k-i 


+ G32(xK)GB2^XK^ 


(13) 


L*DTL*1  . = L.D.lT  . K<i<N-l 

1 1 1+1,1  1 1 1+1,1  


(14) 


+ + ±T 


± r±T 


L7D7L . = L.D.LT  + L.  , .Dt  ,Lt  . . - L7  . .D7  ,L7  . , K<i<N 

ill  ill  isi-l  l-l  i,i-l  i,i-l  l-l  1,1-1  ^ 


(15) 


Some  manipulation  of  the  above  equations  reduces  (13)  and  (15)  to  a rank  one 
modification  form  so  that  the  update  equations  become 


> i<K-l 


L*  . = L . J 

.1+1,1  1+1,1 
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(11) 


LK-lDK-lLK-i  ~ LK-1DK~1LK-1  1 G$1(xK-1>G61CxK-1) 

+ + h t T T—  — 

LK-1DK-1LK,K-1  " LK-1DK-1LK,K-1  1 GBl(xK-l)G32(xK) 


± ± ±T  T T 

LKDKLK  = LKDKLK  1 XKXK 


XK  = ^G32(‘XK)  ^.K-A-lSl^K-l^^^K-l* 

qK-l  = GB1<'XK-1^<'LK-1DK-1LK-1^  GB1(xK-1^ 


+ + +T  T 

L7D7L7  . * = L.DfL. , , . K<i<N-l 

l l 1+1 >1  i i 1+1,1  


+ + ±T  T T 

L , D . L . = L.D.L.  ± x . x ♦ K<i<N 

ill  ill  li  — 


T _1T  ti 

xi  = Lisi-iLi-ixi-i/<1±qi-i)  (21) 

qi-l  = Xi-iai-lDi-lLI-l)_lxLl  (22) 

Equations  (11),  (16),  and  (20)  are  solved  for  the  updated  factors  using 
methods  cl  and  c2  of  Gill,  Golub,  Murray,  and  Sanders  [1].  Method  cl  is  used 
for  addition  (+)  because  there  are  many  more  adds  than  deletes  and  cl  is  the 
most  computationally  efficient  of  the  update  methods.  Method  c2  is  used  for 
delete  (-)  since  stability  can  be  a problem  in  this  case.  Equations  (12)  and 
(19)  are  solved  for  the  rows  of  L±|+1 

After  applying  the  above  update  equations  sequentially  to  add  or  delete 
the  3th  receiver  from  all  observation  intervals,  the  effect  of  the  3th  receiver 
on  the  objective  function  C must  be  computed.  The  new  objective  function  C is 
easily  computed  by  obtaining  the  new  values  of  cov(xj.)  from  the  available  L and 
D factors  of  the  information  matrix. 


oov(^)  = l/d-1!,-1 


°°v(;i-l)  = Li-1(  i , i-lcov<;;i 1 Li , i-ltDi-l )Li-l  1»2-N 
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if  ■ RADAR  AND  C I l\L  _ BATCH  mCKSSOR  AND  INFORMATION  MATRIX  UPDATE,  The 
radar  and  cine  batch  processors  minimize 


N M 

Q(Xi,x2,*--^N)  = l l (Za(t.)-ga(xi)-Sotb)T  R^U) 
1=1  a=l 

(Z  (t. )-g  (x. )-S  b) 
a i “o  i a 


(25) 


where  b is  the  composite  bias  vector 


b = CbJ_b2 — bM3 


(26) 


and 


S b = b 
a a 


(27) 


The  information  matrix  of  this  batch  processor  is  a ( 3N+pM)x( 3N+pM)  bordered 
block  diagonal  matrix 


A = 


A, 


\ 


o 


\ 


\ 


\ 


\ 


\ 


AT 

1,N+1  2.N+1 


1,N+1 


2 ,N+1 


AN  AN,N+1 


N+1An)-1 


(28) 


The  Ai  i=0.,N  are  3*3,  Ap+p  is  pMXpM  and  the  Ai^+1  are  3XpM.  These  matrices  are 
functions  of  the  trajectory  position  vectors,  the  measurement  noise  variances 
and  the  partial  derivative  matrices  3ga(xi)/3xi=Ga(xi) , A is  factored  as 
A=LDLT  where  L is  unit  lower  triangular  and  D is  diagonal.  The  simple  structure 
of  A allows  L to  be  chosen  as 
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(29) 


The  Li  are  3*3  unit  lower  triangular,  Ljj+i  5i  are  pMx3,  and  L^+i  is  pMxpM  unit 
lower  triangular.  The  matrix  D is  partitioned  into  N 3*3  diagonal  blocks  Di 
and  one  pMXpM  diagonal  block  D;j+i . The  Li,  Ln+1 ,i,  and  Di  are  computed  from 


L.D.lT  = A.  i-1 ,N 
111  1 

L.D.lJ,  , . = A. 
i i N+l,i  i,N+l 

T 

LN+lDNtlLN+l  = AN+1 


N 


l 


T 

LN+l,iDiLN+l,i 


(30a) 

(30b) 

(30c) 


Adding  or  deleting  the  gth  instrument  site  at  the  Kth  time  point  yields 
the  information  matrix  update  equation 


A±  = A±  J6(xk)Jb(Sk) 


(31) 


where  A±  is  the  information,  matrix  after  adding  a measurement  (+)  or  deleting 
a measurement  (-).  Jg(xK)  is  a (3NtpM)xp  matrix  which  is  partitioned  as 


J^(xK)  = [0  — 0G&(xk)  0 — 0 Sp] 

Kth  (N+l)st 

block  block 

* + + +T 

If  A+  is  factored  as  L”D  L 


(32) 


+ + +t  rr  — T1 

L"D“L"  = LDL  + Jg(xK)  J^(xK) 


(33) 


Equating  block  elements  on  both  sides  of  the  last  equation  gives  the  update 
equations  for  L^ , L^+1  . , and  as 
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L.  ~ L. 
i 1 


+ 

D7  “ D. 
x i 


> i t K 


LN+l,i  = IjN+lsi  J 


L?KLf  = W*  1 G6(;K>Ge<;K> 


VYLH-t-l,K  LKUKLN+1,K  1 GgtxK}SB 


+ ± ±t  T T 

LN+1DN+1LN+1  " LN+1DN+1LN+1  + ln+i ,kdkln+i,k 


(34a) 

(34b) 

(34c) 

(35) 

(36) 

(37) 


+ + +T  T 

-L’  T)“L  + Q c 

N+1,K  K1+1.K  “ *fTg 

The  above  update  equations  are  solved  by  reducing  (35)  and  (37)  to  a sequence 
of  p rank  one  modifications.  Let  G|^(xk)  be  the  jth  column  of  G%(.xv)  and  sX’-i 
the  jth  column  of  ST.  Then  (35)-(37)  reduce  to  p Is 


lkdklk  = lkdklk  1 Sj^K^gj^K^ 


LKDKLN+1,K  = LKDKLNtl,K  * Ggj(xK)Sgj 


± ± ±T  T T 

N+A+l  H+l  Al+lAj+lhl+1  ~ Xjxj 


> j=l»P 


with  T 


xt  = (Si,-L. 


-1„T 


1 


Bj 


N+1,KLK  G6j (xK ) )/(l±q) 


q = Ggj^XK)(LKDKLK)  Ggj^XK‘> 


(38) 

(39) 

(40) 

(41) 

(42) 


+ X 4-  4 

In  the  above  Lk^Lk,  Dk<Dk , Ln+1,k«-Ln+1  ,K,  Lk+j  after  the  jth  rank  one 

inodifxca txon . Methods  cl  and  c2  of  ClJ  are  again  used  to  solve  for*  the  updated 
factors* 


After  applying  the  above  update  equations  sequentially  to  add  or  delete 
the  gl'h  instrument  site  at  all  trajectory  time  points,  the  effect  of  the  gth 
instrument  on  the  objective  function  c must  be  computed.  The  new  objective 
function  C is  easily  computed  by  obtaining  the  new  values  of  cov(xj)  from  the 
available  L and  D factors  of  the  information  matrix. 
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(42) 


LN+lPi  “ LHfl,i 

cov(^)  = ^(dTVpTd’^P.Jl"1  1=1, N (43) 
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A COMPUTER  SOLUTION  OF  THE  BUCKINGHAM  PI  THEOREM 
USING  SYMBOLANG,  A SYMBOLIC  MANIPULATION  LANGUAGE* 


Morton  A.  Hirschberg 
US  Army  Ballistic  Research  Laboratories 
Aberdeen  Proving  Ground,  Maryland  21005 


ABSTRACT 


Among  the  theories  of  similitude,  application  of  the  Buckingham  Pi 
Theorem  allows  one  to  find  meaningful  relationships  among  variables,  check 
the  formulation  of  a system  of  equations,  and  allow  prediction  from  scaled 
parameters.  Simply  stated,  the  Pi  Theorem  asserts  that  if  there  are  n 
variables  involving  N fundamental  units,  these  may  be  combined  to  form  n-N 
dimensionless  parameters  each  containing  N+l  variables. 

SYMBOLANG  is  a FORTRAN-based  symbol  manipulator  using  a list  structure. 
Symbol  manipulators  operate  on  strings  of  symbols  rather  than  numbers.  As 
an  example,  a symbol  manipulator  allows  multiplication  of  N+l  by  N-l  to 
obtain  N^-l,  A list  structure  allows  data  to  be  stored  and  manipulated  by 
relationships  rather  than  sequentially  or  by  some  other  scheme. 

The  solution  of  the  Pi  Theorem  involves  forming  Pi  Terms.  This  is  done 
by  the  investigator  ordering  the  equations  of  the  system  in  order  of  impor- 
tance. The  equations  are  then  exponentiated  and  multiplied  together  in 
groups  depending  upon  the  sizes  of  n and  N.  Once  a Pi  Term  is  formed,  the 
dimensionless  set  is  solved  for  by  setting  the  exponents  of  the  parameters 
to  zero  and  using  any  routine  which  will  solve  a linear  set  of  equations. 

The  numeric  solution  is  then  paired  with  the  parameter  to  which  it  belongs. 

Planned  extensions  of  this  work  Involve  tabling  well  known  dimension- 
less numbers  and.  checking  generated  solutions  against  the  tabled  set. 
Furthermore,  data  for  the  parameters  can  be  input  and  full  numeric  answers 
obtained.  In  addition,  the  computer  can  generate  a large  number  of  solu- 
tions by  permutting  the  equations.  Permutted  solutions  can  be  factor- 
analyzed  and  a regression  fit  made  to  determine  a "best-solution." 


♦This  article  appeared  as  BRL  Report  No.  1824. 
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The  method  of  dimensions  is  somewhat  dangerous  hut  when  used 
with  proper  aare3  it  is  unquestionably  of  great  power  and 
value . 1 


I.  INTRODUCTION 

The  theory  of  dimensions  has  a long  history  of  development 
dating  back  to  the  ancient  Greeks,2  Galileo,  Newton,  Fourier,  Lord 
Rayleigh,  Vaschy,  Buckingham,  and  Bridgeman  are  a few  of  the  men 
instrumental  in  the  development  of  dimensional  analysis. 

The  theory  of  dimensions  is  important  when  we  wish  to  compare 
results  for  two  arbitrarily  selected  systems  of  units,  It  is  useful 
in  areas  where  knowledge  is  developing  through  an  intermediate  stage, 
when  basic  laws  are  already  known  but  problem  solutions  are  lacking. 
Its  application  may  yield  interesting  conclusions  based  upon  general 
physical  assumptions  which  are  themselves  uninteresting.^ 

The  theory  of  dimensions  has  three  important  applications4: 

1*  It  supplies  one  with  useful  checks  against  errors  made  in 
calculations. 

2.  It  suggests  forms  of  physical  laws. 

3,  It  allows  the  prediction  of  behavior  of  a full-scale  system 
from  the  behavior  of  a model. 

The  rest  of  this  paper  will  deal  with  all  three  applications; 
however,  the  very  nature  of  the  method  employed  guarantees  the  first 
application,  and  planned  extensions  of  the  model  will  form  the  basis 
for  a future  report  of  the  third  application. 


^Strutt,  J.W.,  Baron  Rayleigh,  The  Theory  of  Sound,  Vol,  I,  New  York, 
Dover  Publications,  1945. 

2 

Macagno,  E.O.,  Historico-Critical  Review  of  Dimensional  Analysis, 
Journal  of  the  Franklin  Institute,  292,  391-402,  1971. 

3 

Sedov,  L.I.,  Similarity  and  Dimensional  Methods  in  Mechanics,  New 
York,  Academic  Press,  1959.  ' — 

4 

Synge,  J.L.,  § Griffith,  B.A.,  Principles  of  Mechanics,  New  York, 
McGraw-Hill,  1942. 
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II.  THE  BUCKINGHAM  PI  THEOREM 


In  dealing  with  the  equations  of  physical  phenomena,  one  often 
multiplies  and  divides  various  quantities  and/or  parameters.  The 
rules  of  combination  are  essentially  inherent  in  the  parameters  at 
hand.  If  two  physical  quantities  possess  the  same  dimensionality, 
we  say  their  quotient  is  dimensionless  or  non-dimensional.  Pi  terms 
are  the  names  given  to  such  non-dimensional  products  or  quotients  of 
the  original  parameters.5 

As  a general  rulej  it  takes  N+l  variables  to  form  a dimension- 
less term  with  N fundamental  units.  This  leads  one  to  a simple 
statement  of  the  Pi  Theorem: 

If  there  are  n variables  involving  N fundamental  units,  they 
can  be  combined  to  form  n-N  dimensionless  parameters,  each  contain- 
ing N+l  variables,6 

The  Pi  Theorem  was  first  proved  by  Buckingham7  and  is  often 
referred  to  as  the  Buckingham  Pi  Theorem, 

As  an  example,  all  the  quantities  in  mechanics  can  be  expressed 
using  three  fundamental  units,  e.g.  force  (F),  length  (L) , and  time 
(T) . All  quantities  in  mechanics  can  be  expressed  in  the  form 

pa  ^3  py  where  a,  6,  and  y are  positive,  zero,  or  negative  powers 
which  are  not  necessarily  integers.  The  following  is  a list  of  a 
few  of  the  dimensions  which  can  be  formed: 


Baker,  W.E.,  Westine,  P . S . , S Dodge,  F.T.,  Similarity  Methods  in 
Engineering  Dynamics:  Theory  and  Practice  of  Scale  Modeling,  New 

Jersey,  Spartan  Books,  1973. 

^Housner,  C.W.,  § Hudson,  D.E.,  Applied  Mechanics  Dynamics,  New  York, 
van  Nostrand,  1950. 

7 

Buckingham,  E.,  On  Physically  Similar  Systems:  Illustrations  of  the 

Use  of  Dimensional  Equations,  Physical  Review,  2,  345-376,  1914. 
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[Velocity]  = LT"1 
[Acceleration]  = LT~2 
[Force]  - F 

[Moment  of  a Force]  = FL 
[Linear  momentum]  - FT 
[Angular  momentum]  = FLT 
[Density]  = FT2  L"4 
[Power]  = FLT’'1 
[Pressure]  = FL*2 
[Viscosity]  = FL"2  T 

where  the  "="  is  a true  equality* 

The  solution  of  a dimensional  set  of  equations  involves  forming 
Pi  Terms  and  solving  the  system  of  equations  formed  by  the  resulting 
coefficients  of  Pi  Terms.  That  is,  one  solves  equations  of  the  form 


^ii  + + + _ ® 

where  each  equation  is  the  coefficient  of  L,  F,  and  T respectively. 
One  solution  is  formed  for  each  possible  Pi  Term. 

A computer  program,  described  in  a later  section,  has  been 
developed  to  form  the  Pi  Terms  and  solve  the  resulting  equations 
which  render  the  system  dimensionless. 


III.  FORMULA  MANIPULATION 

Formula  manipulation  (or  symbolic  manipulation)  is  primarily 
processing  of  non-numeric  data.  Several  languages  exist  for  such 
purposes. ® Basically,  a formula  manipulator  is  written  as  a specific 
list  processing  language,  or  language  which  processes  strings  of 
symbols . 


8 

Sammet,  J.t*,  Survey  of  Formula  Manipulation,  Communications  of  the 
Association  for  Computing  Machinery,  9^  555-569,  1966. 
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The  language  selected  to  form  Pi  Terms  and  solve  the  system  of 
coefficient  equations  was  SYMBOLANG . 9 > 1 0 SYMBOLANG  is  written  almost 
completely  in  FORTRAN  and  is  itself  an  extension  of  SLIP, 1 1 1 11  > 1 3 a 
list-processing  language  also  written  in  FORTRAN. 

The  combined  SYMBOLANG -SLIP  system  comprises  nearly  180  sub- 
programs and  allows  for  arithmetic  operations  (addition,  subtraction, 
multiplication,  division) , exponentiations,  substitutions,  evaluations, 
and  differentiations. 

Lists  may  be  created,  manipulated,  and  destroyed,  with  storage 
of  destroyed  lists  being  reusable.  In  addition,  recursive  or 
repetitive  processing  is  permitted  in  a limited  sense  (using  the 
computed  GO  TO  statement) , In  toto,  the  SYMBOLANG-SLIP  system  is 
powerful  and  easily  extendable.  No  new  language  is  required  to  use 
the  system.  One  need  be  familiar  with  FORTRAN  and  acquainted  with 
the  subprograms  comprising  SYMBOLANG  and  SLIP. 


IV.  METHOD  OF  SOLUTION 

The  number  of  parameters  and  the  parameters  of  the  system  are 
input  and  displayed.  This  is  done  for  completeness.  Next,  the 
number  of  equations,  n,  and  the  equations  of  the  system  are  input. 
The  latter  are  input  as  SYMBOLANG  lists  and  are  assumed  to  be 
ordered  with  those  equations  deemed  most  important  first.  That  is, 
earlier  entered  equations  are  more  likely  to  produce  fruitful 
results.  The  equations  are  displayed.  The  number  of  fundamental 
units,  N,  is  also  an  input.  From  n and  N,  the  number  of  Pi  Terms 
to  be  produced  is  calculated. 

The  equations  are  then  partitioned  into  new  SYMBOLANG  lists 
(herein  called  factors)  containing  the  portion  to  the  right  of  the 
equal  sign.  Similarly,  the  portion  to  the  left  of  the  equation  is 
also  retained  (this  will  now  be  referred  to  as  a variable) . Factors 
involve  fundamental  units  of  the  system. 


9 

Finder,  N.V.,  Pfaltz,  J.L.,  5 Bernstein,  H.J.,  Four  High-Level 
Extensions  of  FORTRAN  IV:  SLIP,  AMPPL-II,  TREETRAN,  SYMBOLANG, 

New  York,  Spartan  Books,  305-387,  1972. 

^Hirschberg,  M.A.,  SYMBOLANG-A  SLIP  Extension  for  Albegraic  Manipu- 
lation, Ballistic  Research  Laboratories  Report  No.  1749,  Nov  1974. 
n(AD  #A003190) 

Weizenbaum,  J.,  Symmetric  List  Processor,  Communications  of  the 
Association  for  Computing  Machinery,  6_,  524-544,  1963. 

12 

Reference  9,  pp  1-82. 

13  ' 

Hirschberg,  M.A.,  SLIP  for  the  BRLESC  II  COMPUTER,  Ballistic  Research 

Laboratories  Report  No.  1731,  July  1974.  (AD  #A000653) 
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Pi  Terms  are  formed  by  systematically  raising  each  newly  created 
factor  to  a power  (powers  are  expressed  as  variables  for  solution, 
that  is  A,  B,  C,  etc.)  and  then  multiplying  the  exponentiated  factors 
together. 

There  is  always  one  factor  in  each  Pi  Term  that  has  as  its 
exponent  1.  That  is,  it  is  not  raised  to  a power,  but  is  used  as 
is,  while  all  the  other  factors  are  exponentiated.  This  selection 
of  primary  factors  is  made  systematically. 

Once  the  product  term  has  been  formed  (all  exponentiated  factors 
multiplied  together),  the  product  is  displayed.  It  is  of  the  form 

FACTOR®1  FACTOR-"2  FACTOR  “3  . . . FACTOR  "n 

1 2 3 n 

where  ai  = Cj  + C2  A + B + C... 

i i i i 

and  is  a signed  integer  0,  ± 1,  ± 2,  ± 3,  etc. 


The  equations  are  solved  by  setting  the  ai's  to  zero  and  then 
evaluating  the  resulting  set  of  simultaneous  linear  equations  (this 
can  be  done  using  any  matrix  inversion  routine  where  the  solution 
vector  is  calculated) . 

The  solutions  of  the  simultaneous  set  of  equations  are  then 
paired  with  their  proper  variable.  This  completes  both  the  symbolic 
and  numerical  portions  of  the  solution  of  the  Pi  Theorem. 

Appendix  I contains  the  computer  listings  for  solving  the 
Buckingham  Pi  Theorem. 


V.  A SAMPLE  PROBLEM 

The  following  problem  is  adapted  from  Housner  and  Hudson.  Let 
us  consider  the  drag  force  F acting  upon  a body  moving  through  a 
fluid.  Let  us  assume  a constant  velocity  v through  a fluid  of 
density  p and  viscosity  y.  The  analysis  is  to  apply  to  bodies  of 
a specified  shape,  so  the  cross-sectional  area  A may  be  used  as  a 
measure  of  the  body's  size.  The  following  variables  and  fundamental 
units  enter  into  the  problem; 
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Variable 

Fundamental  Unit 

F 

= F 

y 

= FL'2T 

A 

= L2 

p 

= FL~4  T2 

V 

1 

il 

Two  Pi  Terms  can  be  formed  from  this  set.  Using  Force  and 
Viscosity  as  the  unique  variables  for  solution,  we  arrive  at 


H1  - F AA  pB  v^  and  TT^  = uAA  v^ . 


Expanding  these  equations  in  terms  of  the  fundamental  units  and 
solving  the  exponential  equations,  we  arrive  at  the  following 
solution: 


H.  » — and 

1 »pv2 


n2  ■ 


vA1/2p 


Table  I shows  the  computer  inputs  and  outputs  for  the  problem 
above.  Each  Pi  Term  is  developed  and  displayed.  Following  the  Pi 
Term  is  the  numerical  solution  of  the  exponents  giving  rise  to  the 
dimensionless  set. 


VI.  DISCUSSION 

The  presented  computer  solution  of  the  Buckingham  Pi  Theorem 
provides  one  solution  for  a physical  system.  It  is  not  necessarily 
an  optimum  or  "best”  solution.^4* 15  Similarly  the  computer  solution 


14 

Blau,  G.E.,  Optimization  of  Models  Derived  by  Dimensional  Analysis 
Using  Generalized  Polynomial  Programming,  Journal  of  the  Franklin 
Institute,  292,  519-526,  1971. 

^Chen,  W.K.,  Algebraic  Theory  of  Dimensional  Analysis,  Journal  of  the 
Franklin  Institute,  292,  403-422,  1971. 
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TABLE  I 


INPUTS  AND  OUTPUTS  FOR  THE 
COMPUTER  SOLUTION  OF  THE  BUCKINGHAM  PI  THEOREM 


NLJMRpB  OF  PRIMITIVES 


•I  THEOREM  silver 


PRIMITIVES  ONLY  USED  FOR  COMPLETENESS 
FORCED*  " 

VISCOSITY* 

AREAS 

density*  I INPUTS 

VELOCITY* 

FORCE* 

lengths 

TIMES 


THERp  ARE  * FORMULAS  INVOLVING  3 

FORCED "FORCE** 

V I SCOSI TY«FORCE*T 1 ME /LENGTH* *2** 
AREA«LEM0TH**2SS 

DENS  I TY»EORCF*T I ME**2/LENGTH**4«* 
VEL0CITY»LFNGTH/T1ME*« 


V AR  I A«l  p* 

* INPUTS 


THERF  ARE  2 PI  TERMS 

LTV  ■ FORCE** ( J + fl ) -LENGTH** ( 2* A - 4*8 

*TIMF**I?*B  - C> 

S 

SEND  OF  EXPRESSION 

SOLUTION  FOR  PI  TFRM 

forcfd  **  .lnoonoooE  01 

area  **  -.inooooonE  ni 

density  **  -.lnononone  ni 

VELOCITY  **  -,2nnononnE  01 

LTV  ■ Force**!!  + 8 ) -LENGTH** ( -2  + ?*A 

*B  * C)*TIME**(1  + 2*  B - C» 

* 

SEND  OF  EXPRESSION 
SOLUTION  FOR  PI  TERM 

viscosity  •*  .inonooonfc.  oi 

area  **  -.sonnonoDt  oo 

density  **  lnonoounF.  m 

velocity  **  i nnnoonnE  ni 
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might  need  to  have  a transformation  applied  to  it  to  put  it  in  a 
form  where  it  proves  more  usable  (i.e.  cubing  the  terms  of  one  of 
the  Pi  Terms,  etc* ) . 

If  the  user's  insight  was  good,  the  numerical  solution  (results 
obtained  by  using  values  for  the  parameters)  one  would  obtain  would 
be  the  same  as  those  for  an  optimum  or  "best"  solution* 

The  planned  extension  of  this  work  is  to  provide  data  for  the 
variables  and  calculate  a Buckingham  Pi  Theorem  solution.  Next 
apply  numerical  values  and  save  the  results*  Next  calculate  a new 
Buckingham  Pi  Theorem  solution  and  again  apply  numerical  values  to 
this,  saving  the  results.  The  computer  can  be  programmed  to  do 
this  for  a great  many  of  the  total  possible  Pi  solutions.  The 
numerical  results  can  then  be  factor- analyzed  and  a regression  fit 
made.  The  numerical  data  can  then  be  fed  back  through  each  Pi 
solution  and  solutions  closest  to  the  regression  can  be  saved  for 
later  analysis*  In  this  manner,  a quasi-empirical  best  solution 
can  be  obtained  for  further  study  and  use  in  later  analysis  of  the 
system  under  consideration* 

Furthermore,  common  non-dimensional  numbers16  may  be  tabled 
and  as  computed  solutions  are  formed,  these  may  be  compared  to  the 
tabled  values*  If  a match  is  found  this  can  be  noted,  as  it  is 
of  considerable  use  to  know  if  a computed  Pi  Term  is  one  of  the 
common  dimensionless  numbers* 


16Land,  N.S.,  A Compilation  of  Non-dimensional  Numbers,  Washington, 
D.C,,  U.S*  Govetment Printing  Office,  NASA  SP-274,  1972* 
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APPENDIX  I 


Computer  Listing  of  the  Buckingham  Pi  Theorem  Solution 


268 


oooo  r>  noo  n r»  r>o  o o n r> 


PROGRAM  RUCKY 


THIS  IS  THE  PI  THEOREM  SOLVER 
WRITTEN  BY  M A HTRSCHBERG 

October  ip74 

SET  UP  SLIP  STORAGE 

COMMON  AVSL,X(100> 

dimension  sp(ionoo) 

SET  UP  PROGRAM  STORAGE 

n l MEN  Slow  LNEW( l00l,LTEMP(10ni,LP0WER< 100) 

DIMENSION  LPRIMSI 100) ,LFORMS( 100) 

DIMENSION  SOLTN  t 24 ) 
n I MENS  I ON  I7EEI78I 
DIMENSION  FEORM ( 100) 

DIMENSION  LFXPI7R) 

DATA  ONE/1,/ 

DFF1NE  EXPONENTS  A-7#Al-7l, A2-72 

data  izff/iha, ihb, ihc, ihd, ihe, ihf, jhg, ihh, jhi, ih.j, ink, ihi , jhm, hn, 

1 1 HO, 1 HP, 1 HO* 1 HR, 1HS, 1 MT, 1 HU, 1HV, 1 HR » 1 MX. 1 HY, 1H7, 

22HA1  ,?HR1 ,2HCI  ,2H01  ,2HE«.  ,?HF1 ,2Hgl  ,?HHt  , 2H  t 1 , 2H.T  i , ">hk  1 , 2HL 1 , ?Hi  l , 
S?HNl,2HOl,2HPl,2H01,2HRl,2HSl,2HTl,2Hlll,2HVl,?Hwi,?HXt,?HYl,?H7  1, 
4j>MA2>,  2HG2»  2HC2, 2Hn2,  2HE2,  2HF?,  ?HG?,  2HH? , pH  I 2,  pH  T? , 2MK  ? , 2HL 2 , ?H U ?, 
5?HN?,  ?H0?,?Hp?,2HQ2,2HR?,?HS?,2HT2,2M|l2,2Hv2,2Hu'?,?HX2,?HYP,?H7?/ 

SETUP  WORKING  STORAGE 

CALL  INITASI SP, 10000 ) 

READ  NUMBER  OF  PRIMITIVES 
RE  A 0 (5,10)  NPRMS 
tO  FORMAT  (IS) 

WRITE  (6,12)  NPRMS 

1?  FORMAT < 1H1.S5X, 18H  PI  THEOREM  SOLVER/ 

1 2 1 H NUMBER  OF  PRIMITIVES, 4X, 15) 

WRITE  (6,14) 

14  FORMAT  (38H0PRIMIT1 VES  ONLY  USED  FOR  COMPLETENESS) 

READ  PRIMITIVES 

DO  20  1-1, NPRMS 

LPRIMSII )-INLIST( LPRIMS( I ) ,SH INPUT, PRO) 

2D  CONTINUE 
OUTPUT  PRIMITIVES 

DO  30  1-1, NPRMS 

call  lsOpnt ( lpRI ms( i ) , i ohprimit Ives, prp.,temp) 

30  Continue 

C REAn  NUMBER  OF  FORMULAS 
REAn  (S,10)  NF  RMS 

C READ  NUMBER  OF  INDEPENDENT  VARIARLES 
READ  (5,10)  I VARS 
WRITE  (6,32)  NERMS, IVARS 
32  FORMAT  (10H0THERE  ARE , 1 X , I 5 , 1 X , 8 HF ORMUL A S , 

1 IX, PH  INVOLVING, IX, I 5, IX, PH VARIABLES) 

C READ  formulas 

DO  40  I - 1 ,NFRMS 

CALL  ini  ISTILFORMS* I),5H1NPUT,3HVAL,TEMP) 

40  CONTINUE 
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no  oo  oo  on  on  ooooono  o o 


calculate  the  number  of  pi  terms 

I P I -NFRMS- I APS 
WRITE  (6,42>  IPI 

4?  FORMAT  ItOHOTHERE  ARE, IX, 15, IX, BMP!  TFRMS) 

OUTPUT  FORMULAS 

no  50  1-1, NFRMS 

CALI  LSOPNT (LFORMSI I ) , ^FORMULAS, 990. , TF«P ) 

50  CONTINUF 

STRIP  EQUALS  OFF  ANO  SETUP  NEW  SYMBOLAMG  LISTS 
IXOT-O 

no  ?no  i-i, nfrms 

SET  UP  READFR  for  FORMULAS 
LRn«LRDROV(l FORMS!  I )) 

STRIP  FIRST  PART  of  FORMULA 
no  fin  ii»i,a 

OAT UM»AOvSER(LR0, FLAG) 

IF  (I! ,NF.S>  00  TO  60 
IXOT-IXPTM 

save  variable  name  in  formula  for  later  use  (Solution) 

FFORMI  I x l)T  ) « n ATUM 
fiO  CONTINUE 
| Man 

LW-LISTUw) 

ICan 

65  CONTINUE 

ADVANCE  THRoUOh  list 

n*TUM-AnvSFR(LRn,FLAG) 

IE  (FLAO.NF.O.)  GO  TO  70 

set  up  temporary  list  and  count  elements 

JC-1CM 

CALL  NEWBOT(nATUM,LW,TEMp> 

GO  TO  65 
70  CONTINUE 
C 

C SET  COUNT  And  form  NEW  new  SVMRoi AMG  LIST 
LNEW(  I ) all 

LNFW( ! ) -LI  ST ( LMEw< I ) ) 

LROaLRpROV(LW) 

LTEMP ( 1 ) »L ! ST ( 9 ) 

CALL  newbotj  LTEMP( I ) , LNEw ( I ) , TEMp ) 
l.C-IC-2 

IF  ILC.LE.O)  GO  TO  2P00 

no  SO  Ja 1 , LC 
nATHMaAnVSER(I.RD,FLAG) 
call  newboti datum, LTEMP( I ) ,TFMP) 
sn  CONTINUE 

C 

C ERA8F  TEMPORARY  LIST  ANP  PRINT  NEW  LjST 
CALL  IRALST(LW) 

C CALL  LSOPNTILNEWI I ),4HLNEW,Q99.,TEMP) 

200  CONTINUE 
KaNFRMS-1 
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no  71 n I-1,K 


C SETUP  ExPONFNTS 

LEX Pit I-LSQMNt (LSUHM3( I I ZEE (I) , l.  II 
210  CONTINUE 

C 

C SETUP  PI  TERMS 

LERMS-LSQMN1 CLSQMNl ( 1 . ) ) 

no  76n  i - 1 , t p i 

LTVwLSOMNl CLSOMNi < 1.  ) I 

KK«n 

C CYCLE  THROUGH  FORMULAS  MOST  IMPORTANT  ANfl  SKIP  FORMULAS  SO  As 

c to  include  only  the  important  formulas  one  at  a time 
no  245  .!■  I » NFRMS 
IF  ( J.EO.I  ) GO  TO  215 
IF  (J+IVARS.LE. NFRMS)  GO  TO  245 
KK-KKM 
C 

C RAISE  POWER 

LPOKIERIKK  )«L$ORAZ(LNEW(.t),LEXP<KK)  ) 
I..TU«LSOMFX(LTv*LPOwERIKK)  ) 

CALL  LSonFSILTV, TEMP) 

TLGn-SEURnRILTU) 
l TV-LSUCPYITLRD) 

CALL  LSonESILTUfTEMPJ 
GO  TO  245 
2 1 5 CONTINUE 

CALL  LSOPESILTV,TEMP) 

LTv»LSOmEX(LFRMS,LNEw( JI  ) 

24S  CONTINUE 

CALL  LSQPNT  <LTV,3HLT  v,9<*9.  ,TFMP) 

C CALL  PRLSTS ( LT V#  4 ) 

C 

C Sol.VF  FOLIATIONS 

CALL  BUCKS VC  LTV, I ZEE# SOI TN, I COUNT) 

CALL  LSOnES(LTv»TEMp) 

nn  ?so  j«i,kk 

CALL  LSOPFSCLPOWERC J) ,TEMPj 

230  CONTINUE 
C 

c print  solution  for  pj  term 

WRITE  18,251) 

231  FORMAT ( 21HOSOLUTION  FOR  PI  TERM  1 
KK«0 

00  238  J-l, NFRMS 

IF  CJ.EO.I)  write  CB,253)  FFORMC!),ONF 
253  FORMAT  C1H  , A I 0# 1 X# 2H** # 1 X, El  A . S ) 

IF  (J+IVARS.LE. NFRMS)  GO  TO  258 
KK-KK+L 

WRITE  (A, 253)  FFORMC  J) #SOLTN(KK 1 
233  CONTINUE 
230  CONTINUF 
CALL  EXIT 
2000  CONTINUE 

WRITE  (3,2010) 

2010  FORMAT  <7H  NO  NO  ) 

CALL  EXIT 
END 

SUBROUTINE  BUCKSVILIST,  ZEE#S0LTN. ICOUNT ) 

C 
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C THIS  SUBROUTINE  SETS  UP  THE  SOLUTION  FOR  THF  PI  THEOREM 
C WRITTEN  BY  MA  H1R3CH8ERG 
C JANUARY  1975 
C 

D I HENS  J OH  7EEI78) 

0 I MENS  1 ON  SOLTN ( 24 ) 

0 1 MENS  I ON  AMAT (24,24) 

DIMENSION  A W 0 R IM  6) 

C 

C CLEAR  STORAGE 

DO  5 t-t,24 
SOLTN ( f ) *0,0 

no  4 J-l , 24 
AMAT( I , J ) -0 . 0 

4 CONTINUE 

5 CONTINUE 

C 

C SET  UP  READER  E OR  LIST 
LR-LRDROV(LIST) 

C SET  FLAGS 

LEVEL-0 
IC0UNT«1 
IFNn»n 
IWORO-O 
I G 0 ■ 0 
JEND-1 
10  CONTINUE 
I G0« I S0+ 1 
jgo»o 

C ADVANCE  THROUGH  LIST 
X-ADVSWR ( LR,  K ) 

IE  IK)  100, 20, 100 
?o  IF  (LEVFI.-LCNTRILR)  ) 150,30,70 
10  IF  (NAMTSTMO)  60,40,50 
40  IF  I L l STMT ( X I I 50,10,50 
C WE  HIT  A SuRLIST 
50  CONTINUE 

LEVEL-LEVEL M 
IEN0-0 
I WORD-0 
GO  TO  10 

C WE  MIT  A DATUM  ELEMENT 
60  CONTINUE 
IENn.nO 

IF  (100, LE, 3)  GO  TO  10 
IF  IJ80.EQ.1)  I COUNT- 1 COUNT ♦ 1 
IF  (JGO.FQ.n  GO  TO  ID 
I WORD- I W0RD*1 
C store  DATUM 

AWORDI I WORD ) -X 
GO  TO  ID 

C WE  HIT  AN  END  OF  SUBLIST 
70  CONTINUE 

LEVEL-LEVFl-1 

lEND-IEND+I 

IF  I IEND.LT, 2)  GO  TO  75 
JGO-1 
GO  TO  20 
75  CONTINUE 

IF  I IwORO.GT.n  GO  TO  80 
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IF  ( IWORD.LF.O)  Go  to  2n 
C STORE  NUMERICAL  COEFFICIENT  (CONSTANT  TERN ) 
ROLTN(  ICOIINTJ—AWORIK  1) 

I W 0 R I)  ■ 0 
SO  TO  20 
SO  CONTINUE 

C STORE  MATRIX  COEFFICIENT 
no  90  I * 1 # 7 s 

IF  ( AWflRlH?)  ,NE.  ?EEU)>  go  TO  90 
AM  A T ( I COUNT  , I )»AI*oRD(  1 ) 

I WORP-O 

SO  TO  (20,1*01,  JENO 
90  CONTINUE 
95  CONTINUE 

CAUL  SI.PERRUOH  BUCKSV  ) 

100  IF  (LEVFL-t CNTR(LR) ) 150,120,110 

no  continue 

level»lfvfi-i 

on  to  ino 

120  CONTINUE 

CALL  RCEIL(IR) 

150  CONTINUE 
JFND-2 

so  to  an 

160  CONTINUE 

INVERT  MATRIX  TO  FIND  NUMERICAL  SOLUTION 

CALL  MAT1NV<  AM A T, J COUNT, SOL TN, 24, 1 , OET  ) 
IE  (OET.EO.O.)  GO  TO  95 
RFTIIRN 

FNO 
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- PIPS  - 

AN  INTERACTIVE  GRAPHICS  PROGRAM 
FOR  DETERMINATION  OF  MASS 
PROPERTIES  OF  IRREGULAR  PLANAR  SOLIDS 


R.  I.  ISAKOWER 
AND 

F.  R.  TEPPER 

PICATINNY  ARSENAL,  DOVER,  N.J. 


ABSTRACT 

PIPS  is  an  interactive  computer  graphics  program 
for  the  calculation  of  the  mass  properties  of  irregular 
planar  solids.  This  program  utilizes  the  graphics 
terminal  to  determine  the  mass,  center  of  gravity,  mass 
moments  of  inertia,  products  of  inertia,  principal  axes 
and  center  of  percussion  of  assemblies  of  irregularly 
shaped  parts  (not  necessarily  touching  nor  in  the  same 
plane)  with  respect  to  arbitarily  selected  3-axis  coordi- 
nate systems.  The  solids  may  either  be  drawn  at  the 
graphics  screen,  or  described  and  read  in  on  punched  cards, 
or  generated  by  attaching  (and  modifying)  an  existing  file 
of  parts.  The  inputted  solids  may  be  accumulated  with 
other  solids  and  their  combined  properties  calculated. 

This  paper  describes  the  techniques  used  in,  and 
operation  of,  the  PIPS  program  along  with  examples  of  the 
successful  application  of  PIPS  to  provide  engineering 
support  to  the  design  analysis  of  problems  associated  with 
hardware  components  and  assemblies. 
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Properties  of  Irregular  Planar  Solids 


THE  PROBLEM 


A.  Background 

PIPS  is  an  interactive  computer  graphics  program  for 
the  calculation  of  the  mass  properties  of  generalized  shaped 
planar  solids.  The  applications  of  this  program  at  Picatinny 
support  the  dynamic  evaluations  of  fuze  components  and 
assemblies  and  missile  and  shell  warhead  sections.  The  need 
for  this  program  illustrates  the  fact  that  even  theoretically 
basic  and  familiar  engineering  calculations  are  at  times, 
not  only  difficult,  but  nearly  impossible  to  accurately  and 
expeditiously  accomplish  in  a design  room  environment. 
Calculations  for  mass,  CG,  and  inertial  terms  are  familiar 
to  engineers,  designers,  and  students,  but  since  actual 
design  components  perversely  do  not  conform  to  "nice" 
uniform  shapes,  determining  these  properties  of  munition 
parts  is  a difficult  and  imposing  job.  This  program  offers 
a considerable  savings  in  development  time  (and  expense) 
and  permits  the  analyst  to  "stay  on  top"  of  the  changing 
design,  thus  optimizing  final  hardware  performance. 

There  are  no  restrictions  to  the  contours,  materials, 
assemblies  of  the  planar  solids,  and  combinations  of  com- 
plicated convoluted  shapes  and  materials  that  are  routinely 
handled  by  the  PIPS  program. 

Any  discussion  of  a computer  program  should  rightfully 
include  a description  of  the  computer  system  upon  which 
it  operates.  There  are  two  graphics  systems  in  use  at 
Picatinny  Arsenal:  a Control  Data  Corporation  274/1700/6500 

refresh  graphics  facility  (fig. 2)  and  a network  of  Tektronix 
4014  storage  tube  graphics  stations  (fig. 4). 

PIPS  was  originally  written  for  the  Control  Data  Corpora- 
tion (CDC)  facility  operating  under  Scope  3.4.1,  IGS  Version 
2,  employing  30  overlays  and  46K  octal  of  60  bit  words  of 
storage.  The  program  was  later  rewritten  for  the  Tektronix 
Terminal  Control  System  (TCS)  package,  and  required  13  over- 
lays and  56K  octal  storage.  The  CDC  274  refresh  graphics 
terminal  is  a twenty  inch  diameter  screen  with  a light  pen 
and  alphanumeric  and  function  keyboards.  It  is  driven  by  a 
mini-computer,  a CDC  1700,  which  is  a satellite  to  the  main 
or  host  computer,  the  CDC  6500,  which  "houses"  the  problem 
programs.  The  problem  programs  are  those  programs  (stress 
analysis,  circuit  design,  etc..)  to  which  graphical  techniques 
are  being  implemented. 

The  Arsenal's  network  of  Low  Cost  Graphics  Terminals 
(LCGT)  are  Tektronix  4014  storage  tubes  driven  directly  by 
the  CDC  6000.  They  feature  a nineteen  inch  diagonal  screen, 
a thumb  wheel  controlled  cross-hair  cursor,  and  an  alpha- 
numeric keyboard.  Work  at  the  tube  is  supported  by  30"x40" 
data  tablets  for  digitizing  input  and  "quick  look"  electro- 
static copying  devices. 
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PRESENT  LARGE  SCALE  IG  SYSTEM 


CDC  274/1700/6500  FACILITY 

• 274  DIGIGRAPHICS  CRT  CONSOLE 

• 20  INCH  DIAMETER  SCREEN  (REFRESH) 

• 4096  UNITS  OVER  20  INCHES 

• LIGHT  PEN 

• ALPHANUMERIC  KEYBOARD 

• FUNCTION  KEYBOARD 

• 1700  DIGITAL  COMPUTER 

• SATELLITE 

• ITEM  IDENTIFICATION  HANDLER 

• GRAPHIC  DRIVERS 

• CARD  READER,  LINE  PRINTER,  CARD  PCH 

• 6500  (HOST)  DIGITAL  COMPUTER 

€ PROBLEM  PROGRAM 


FIGURE  1. 
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PRESENT  LOW  COST  GRAPHICS  TERMINAL 
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FIGURE  3. 


FIGURE  4.  LOW  COST  GRAPHICS  STATION 


B*  Method  of  Solution;  THE  PIPS  ALGORITHM 

P:tPS  employs  what  is  termed  a "slice  method":  the 

solid  whose  properties  are  to  be  determined  is  sliced 
by  the  designer  with  a series  of  selectively  spaced 

-PlQneS  Parallel  slices  or  planar  solids  (fig.  5} 

These  slices  are  handled  by  the  PIPS  code  as  separate  9 
Pieces  whose  properties  are  to  be  calculated.  The  designer 

asYdesiredC>tometth  ^ pieces  in  any  and  as  many  combinations 
new  either  reconstruct  the  solid  or  even  design  a 

Sell  ai  ?hn=Ih%maSS  pfoperties  of  the  individual  pieces  as 
well  as  those  of  any  chosen  combination  of  pieces  are 
calculated  by  the  PIPS  program. 

. . Contour,  thickness,  density,  and  location  relative  to 
x-  plane  of  the  graphics  screen  comprise  the  input  data 

of  theirevev  £he  pi^es  are  described  to  PIPS  in  terms 

of  their  X,Y , and  Z coordinates  in  the  graphic  screen's  3 
ortnogonal  axis  system,  with  the  origin  at  the  canter  of 
T^e  Z axis  is  normal  to  the  screen  and  the  X 

wSJfiiin  ' hoflzontalfy  to  the  right)  and  Y (positive, 
ertically  up)  axes  lie  in  the  plane  of  the  graphics 
screen.  The  pieces  are  constant  thickness  slices  (the 
thickness  may  vary  from  one  piece  to  the  other,  as  may  the 

straiahr  i7nth  ge*}erally  unrestricted  contours,  made  up  of 
Til„  nfli-J1!!?8  fnd  clrcular  arc  segments  in  any  arrangement. 
The  pieces  may  have  any  number  of  holes  (called  inner' 

agai"  of  arBitrary  shape.  Although  the  pieces 
a^i  * 1jdlulduaily  located"  any  distance  in  front  o^ 

of  the^creen.0^  SCreen'  they  must  be  ParaHel  to  the  plane 

gonalf^xia^yswma?5  ^ respeot  to  tw°  ’3  <°«ho- 

,„r.  One  system  has  its  origin  at  the  center  of  gravity 
with  ve  Pa6ve'  °r  assembly  or  Pieces,  being  investigated 

the  piece  UK  3X63  ^ 3 Plane  parallel  to  the  Plane  of 

hprifp^fn  designer  selects  the  location  of  the  other  system, 
hence-forth  referred  to  as  the  User  Reference  System/  The 
origin  of  this  system  may  be  anywhere  (X,Y,Z)  in  three- 

alsrpa?alieiP?ne^hat1the  US6r  desires  with  the  * and  Y axes 
thP fc  tho  plane  of  the  piece  (s ) . Additionally, 

its  7 tli«trenCe  System  may  be  angularly  displaced  about 

parts  or of  3L“SI  (A'  in  degrees>*  station  of  the 

parts  or  of  the  User  Reference  System  about  either  their  x 

on  Lf6S  wonJ.d  produce  artistically  foreshortened  pictures 
the  screen  which,  although  aesthetically  pleasing,  would 
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add  nothing  to  engineering  understanding  of  the  problem  - 
and  add  tremendously  to  programming  complexity.  Thus,  four 
displacement  vectors  (three  linear  and  one  angular)  are 
permitted  for  the  User  Reference  System  out  of  a possible  six. 

It  is  convenient  to  initially  confine  the  discussion  of 
the  PIPS'  algorithm  to  two-dimensional  (plane)  polygons  and 
later,  by  adding  thickness  to  the  polygons,  extend  the  ex- 
planation to  solid  bodies. 

The  outer  and  inner  contours  of  any  shape  input  to  PIPS 
may  be  described  as  contiguous  straight  line  and  circular 
arc  segments.  The  program  drops  perpendiculars  from  the  end 
points  of  the  line  segments  to  a base  line  below.  The  area 
enclosed  by  the  line  segment,  the  two  perpendiculars,  and 
the  base  line  is  automatically  partitioned  into  right 
triangles,  rectangles,  and  circular  segments.  The  section 
properties  of  these  well  known  BASIC  SHAPES  (fig. 6)  are 
in  the  literature. 

All  contour  lines  are  developed;  that  is,  drawn  from 
initial  point  to  end  point.  PIPS  establishes  the  X (hori- 
zontal) development  of  the  first  line  segment  as  what  is 
called  the  "ADD"  direction.  The  section  properties  of  the 
areas  under  all  line  segments  developed  in  that  direction 
are  added  together.  The  section  properties  of  areas  under 
all  line  segments  developed  in  the  opposite  or  "SUBTRACT" 
direction  are  subtracted  from  this  total.  The  summation  of 
all  this  addition  and  subtraction  produces  the  section 
properties  of  the  enclosed  area  of  the  plane  polygon.  This 
is  illustrated  in  the  figure  entitled  SUMMING  CONVENTION. 

(fig. 7),.  To  ensure  positive  values  for  area  and  section  prop- 
erties the  code  uses  the  absolute  values  of  the  summation  re- 
sults. (This  over  simplified  explanation  does  not,  of  course, 
hold  for  product  of  inertia  calculations.  More  detailed  com- 
puter logic  is  employed  to  produce  the  properly  signed  result) . 
Needless  to  say,  all  the  bookkeeping  of  adding,  subtracting, 
re-adding,  etc  of  section  properties  poses  no  strain  to  the 
user  as  it  is  transparently,  automatically  performed  for  him 
by  the  code. 

When  thickness  of  the  areas  is  introduced,  the  line 
segments  become  edges.  The  areas  under  the  line  segments 
are  extrapolated  from  triangles,  rectangles,  and  circular 
segments  to  wedges,  rectangular  parallelopipeds  (boxes) , 
and  cylindrical  segments.  The  mass  properties  of  these  basic 
solids  are  also  available  in  the  literature  or  may  be 
derived,  and  it  is  these  equations  that  are  coded  into  the 
PIPS  algorithm  (figs.  9-14). 
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illustrative  slice 


287 


FIGURE  8. 
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WEDGE  (B) 


P = MASS  DENSITY 
M = MASS  = Px  y z 


MASS  MOMENTS  OF  INERTIA  ABOUT  AXES  THROUGH  C.G. 
'xx-  (Y2  + 2z2) 

lyy  = ( X2  + 2Z2  ) 


*zz  = (x2  + y2> 

PRODUCTS  OF  INERTIA  ABOUT  PLANES  CONTAINING  C.G. 
'xy=  ~ P x2  y2  z = “■  M x y 

lyz=  0.0 

'xz=  00 


FIGURE  10. 


BOX 


P = MASS  DENSITY 
M = MASS  = px  yz 
CENTER  OF  GRAVITY:  x = x/2 

V = y/2 
z = z/2 

MOMENTS  OF  INERTIA  ABOUT  AXES  THROUGH  C.G. 

'xx=  (z2  + V2) 

lyy  = (X2  + Z2) 

'zz=  4M(x2  + y2) 

PRODUCTS  OF  INERTIA  WITH  RESPECT  TO  THE  PLANES  CONTAINING 
THE  C.G. 

*xy  = 0-0 

lyz  = 0.0 
'xz  = 0-0 


Y 


CYLINDER 


FIGURE  12. 


SEGMENTED 
CYLINDER  (A) 


ro 
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FIGURE  13. 


MASS  MOMENTS  OF  THE  INERTIA  ABOUT  AXES  THROUGH  C.G.  OF  THE  CYLINDER 
CONTAINING  THE  SEGMENT 

Ixx  = --^3  ( a - COS  a SIN  a ) + - r-z  ( a - SIN  a COS  a - 2 13  SiN3a  COS  a ) 
p r2  z3  p r4  z 

!yy=  — ^ { a - COSa  SlNa  ) + — - ( a- SlNa  COSa  + 2 SIN3a  COSa  ) 

\zz  = — y -■  ( a - SIN  a COS  a + 2/3SlN3a  COS  a ) 


PRODUCTS  OF  INERTIA  ABOUT  PLANES  THROUGH  C.G.  OF  THE  CYLINDER 
THAT  CONTAINS  THE  SEGMENT 


*xy  *xz  ~ lyi  3 
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<ntf*INC  OIOIS  A*  CtMMDIMTtt 
OF  A «.  m THK  X-Mtt 


of  wtmict  uu 


The  solid  arrows  are  the  X and 
Y axes  of  the  User  Reference  Sys- 
tem. The  two  pair  of  dashed  vec- 
tors are  the  principal  axes  drawn 
through  the  CG  of  the  piece  and 
through  the  User  Reference  Origin. 
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or  ftfrtWO  AXIS 


TMcti m cnost  a«  coordinates 

Of  A AT.  » ?*C  X AXIS 


to  the  piece  may  be  immediately 
seen.  Here  the  User  Reference 
Origin  is  "attached"  with  the 
light  pen  and(  towed  (translated) 
to  another  location.  All  prop- 
erties are  recalculated. 
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FIGURE  21.  TRANSLATION  OF  USER  REFERENCE  ORIGIN 


TRACKING  CROSS  AS  COONOINAItS 
OF  KFttKNCt  AXIS 


tracking  cross  as  coordinates 

OF  A NT.  (w  THE  X-AXIS 


x4r' 


Here,  the  User  Reference  Sys- 
tem is  rotated  about  the  Z axis 
to  a new  orientation.  The  coor- 
dinate system  can  be  relocated 
(and  reorientated)  with  the 
light  pen  or  via  the  keyboard. 
Again,  the  displayed  results  are 
newly  calculated. 
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FIGURE  22.  ROTATION  OF  USER  REFERENCE  SYSTEM 
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The  User  Reference  System  can 
even  be  repositioned  off  the 
screen  — out  of  the  building,  if 
desired.  pips  automatically  re- 
scales the  picture  to  fit.  within 
the  boundaries  of  the  CRT. 
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FIGURE  23.  AUTOMATIC  SCALING 
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FIGURE  27.  CLOSE-UP  OF  FRONT  VIEW 


It  is  possible  at  any  time  to  return  to  the  electronic 
drafting  board  portion  of  PIPS  to  redesign  a piece  or  create 
an  entirely  new  item.  As  an  example,  the  contour  of  the 
recently  examined  piece  is  used  as  the  inner  contour  of  an 
irregular  hole  in  a newly  designed  circular  disk.  The  four 
holes  in  the  original  piece  are  now  small  circular  disks  of 
different  diameters  located  within  the  irregularly  shaped 
inner  contour.  The  new  configuration  is  now  a cluster  of 
five  solids  with  all  of  the  PIPS  graphics  and  calculation 
capabilities  (change  of  contour,  material  and  reference 
axes,  automatic  scaling,  etc..)  available  - a cluster  can  be 
handled  as  if  it  were  a single  entity. 

Finally,  a file  of  any  display  of  a piece  or  assembly 
of  pieces  may  be  catalogued  for  use  by  a plotter  program  to 
produce  a hard  copy  of  what  appears  on  the  screen.  (Slide 
and  Poloroid  cameras  also  provide  reproductions  of  the 
screen  contents  but  their  proper  utilization  require  addi- 
tional skills) . 
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FIGURE  28.  NEW  DESIGN 
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FIGURE  32 


Hard  copy  of  one  page  of  the 
library  of  slices.  Selections 
are  made  by  locating  the  CRT's 
cross-hairs  over  any  portion  of 
the  item  desired. 
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EQUATION  OF  MOTION  OF  FIRST  LEAF 
OF  SAFING  AND  ARMING  MECHANISM 

OF  M509  FUZE 


*P1  0 1 


■ • 

rc1  S COS(Ot+  0r)  + PIVOT  FRICTION  TORQUE 

+ SPRING  TORQUE  + FRICTION  TORQUE  DUE  TO  SPRING 

+ TORQUE  DUE  TO  FORCE  FROM  LEAF  2 EXERTED 
ON  LEAF  1 (INCLUDING  FRICTION) 


FIGURE  35. 


(2)  Proposed  Safinq  and  Arming  Mechanism  of  TOW  Missile 

(a)  Description  of  System 

This  mechanism  is  similar  to  the  previous  device 
with  the  exception  that  only  two  leaves  are  currently 
used. 

(b)  Problem 


A study  was  undertaken  to  evaluate  the  possibility 
of  replacing  the  present  system  with  a single  leaf.  A 
drawing  of  one  of  the  three  proposed  leaves  is  depicted 
in  Fig.  36. 

(c)  Application  of  PIPS 

A relationship  was  derived  expressing  the  "g"  level 
required  to  initiate  motion  of  the  leaf  as  a function 
of  the  leaf  geometry  and  mass  properties,  leaf  and  rotor 
spring  torques,  forces  due  to  leaf  and  rotor  interaction, 
and  friction.  The  expression  for  this  g-level  is  given 
in  Fig.  37.  The  nomenclature  for  this  equation  is  as 
follows: 


Fr  * force  exerted  by  the  rotor  on  the  leaf 

Mr, MS  = spring  torques  on  the  rotor  and  leaf, 
respectively 

w = weight  of  leaf  (determined  by  PIPS) 

X3  = location  of  c.g.  (determined  by  PIPS) 

jX  = coefficient  of  friction 

The  other  parameters  are  geometric  constants  which  are 
defined  in  Fig.  38.  (Note:  x(  =0  for  this  contour.) 

The  PIPS  program  was  utilized  to  locate  the  position 
of  the  c.g.  for  each  of  the  three  candidate  designs. 

It  is  estimated  that  approximately  one  week  of  engineer- 
ing time  was  saved  by  applying  PIPS. 

With  the  help  of  PIPS,  the  mathematical  simulation 
was  able  to  demonstrate  the  feasibility  of  the  single 
leaf  design. 
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SINGLE  LEAF  CONFIGURATION  FOR 
SAFING  AND  ARMING  MECHANISM 

FOR  TOW  MISSILE 


FIGURE  36. 
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SAFING  AND  ARMING 
FOR  TOW  MIS 
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FIGURE  3 


(3)  Timing  Mechanism  of  M577  Fuze 


(a)  Description  of  System 

Two  views  of  the  timer  of  the  M577  fuze  are 
shown  in  Figs.  39a  and  39b.  Fig.  40  is  a schematic 
diagram  of  the  functioning  sequence  of  this  mechanism. 

In  the  safe  position,  the  setback  pin,  which  is  spring 
loaded,  blocks  the  path  of  motion  of  the  spin  detent, 
which  is  also  spring  loaded  (spring  not  shown  in  Fig. 40). 
The  spin  detent  engages  the  balance  wheel  of  the  timer, 
thus  preventing  it  from  operating.  Under  the  influence 
of  the  acceleration  of  the  projectile  in  the  gun  tube, 
the  setback  pin  retracts  against  the  setback  spring, 
releasing  the  spin  detent.  As  the  projectile  progresses 
through  the  gun  bore,  rotation  causes  the  spin  detent 
to  move  radially  outward  against  its  spring  until  stopped 
by  the  fuze  body.  This  movement  frees  the  balance  wheel 
and  sets  the  timer  mechanism  into  motion,  eventually 
arming  the  fuze. 

(b)  Problem 


An  unacceptable  dud  rate  was  experienced  during  lot 
acceptance  testing  of  the  fuze.  It  was  hypothesized  that 
the  setback  pin  had  returned  to  its  original  position  be- 
fore the  spin  detent  had  released  the  balance  wheel,  which, 
of  course,  would  result  in  a dud.  Such  an  event  is  possible 
since,  due  to  friction  torques  induced  by  the  projectile 
acceleration,  the  spin  detent  does  not  move  for  a con- 
siderable portion  of  the  acceleration  pulse.  It  is  only 
when  the  acceleration  level  has  decayed  sufficiently  that 
the  spin  detent  can  move  and  disengage  the  balance  wheel. 

By  that  time,  the  force  exerted  on  the  setback  pin  by 
the  setback  spring  may  be  greater  than  that  due  to  the 
acceleration.  This  makes  it  possible  for  the  setback  pin 
to  return  and  obstruct  the  path  of  motion  of  the  spin  de- 
tent. 
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FUNCTIONING  SEQUENCE  OF  M577  FUZE  TIMING  MECHANISM 


DETENT 


POSITION  1 
BALANCE  IS  LOCKED 

0 = 0 


(A) 

SAFE  POSITION 


POSITION  1 

REST  POSITION  AGAINST  DETENT 


FIGURE  40 


(c)  PIPS  Application 

In  order  to  verify  or  disprove  this  hypothesis , a 
mathematical  model  of  the  behavior  of  this  system  was 
developed.  The  differential  equation  given  in  Fig.  41 
was  derived  to  describe  the  motion  of  the  spin  detent 
(see  Fig.  42).  In  that  equation: 

I - moment  of  inertia  of  detent  (determined 
by  PIPS) 

Ifr  = additional  effective  moment  of  inertia 
due  to  frictional  effects 

0.0,0  = angular  displacement,  velocity  and 

acceleration  of  detent,  respectively 

= mass  of  detent  (determined  by  PIPS  and 
verified  by  weighing) 

L ■ distance  from  projectile  spin  axis  to 
pivot  of  spin  detent 

rc  = location  of  c.g.  of  detent  with  respect 
to  pivot  (determined  by  PIPS) 

CUjCd  = angular  velocity  and  acceleration  of 
projectile 

Cl  = initial  angular  orientation  of  spin 
detent 

The  use  of  the  PIPS  program  to  compute  the  mass, 
moment  of  inertia  and  c.g.  location  of  the  detent  resulted 
in  a savings  of  approximately  three  days  of  engineering 
time.  (To  verify  the  validity  of  the  calculated  values, 
the  spin  detent  was  weighed,  and  the  measured  mass 
differed  only  negligibly  from  the  calculated  mass.) 

The  analysis  showed  that  the  motion  of  the  spin  detent 
is  very  sensitive  to  friction;  a dud  is  quite  likely 
to  occur  whenever  the  coefficient  of  friction  is  of  the 
order  of  .35.  The  dud  problem  was  eventually  alleviated 
when  the  setback  pin/setback  spring  system  was  modified. 
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E.  Development  Plans  and  Conclusions 


There  are  other  programs,  batch  and  graphics,  that 
perform  similar  calculations  on  different  shapes!  solids 
of  revolution  and  asymmetrical  systems  of  bodies  (with 
major  axes  at  skew  angles) . It  is  intended  to  incorporate 
the  special  features  and  strengths  of  these  other  programs 
into  the  next  version  of  the  PIPS  code  along  with  the 
coding  to  use  the  tablet  as  a digitizing  device  to  permit 
rapid  input  of  contours  directly  from  drawings  or  lofting 
templates. 

As  a final  reflection  - the  development  of  batch  pro- 
cessing computer  programs  to  solve  problems  of  this  type 
has  proved  inadequate  to  satisfy  the  analysis  needs  of 
the  designers  who  must  use  them.  The  perceptual  augmenta- 
tion that  graphics  provides  has  proved  to  be  an  essential 
ingredient  in  the  design  - analysis  process,  and  to  insure 
maximum  usefullness,  a substantial  amount  of  human  engineer 
ing  must  go  into  the  development  of  the  graphics  procedures 
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DEVELOPMENT  PLANS 


COMBINE  CAPABILITIES: 

• PIPS*  (PLANAR  SOLIDS) 

• PROMSrPHASOR? WEIGHT  (AXISYMMETRICAL  SHAPES) 

• SOS,  MOMENTS  (ASYMMETRICAL  SYSTEM  OF  BODIES) 

* GRAPHICS  PROGRAMS 


FIGURE  43. 


TESTING  ALGORITHMS  FOR  A MINI-COMPUTER  ON  A MAXI 


F.  D.  Crary 

The  Mathematics  Research  Center 
The  University  of  Wisconsin-Madison 


ABSTRACT.  A common  problem  in  the  development  or  testing  of  algorithms 
to  be  run  on  minicomputers  is  the  lack  of  access  to  the  eventual  target 
machine.  In  such  cases,  development  is  done  on  other  equipment.  Occasionally 
the  success  (or  failure)  of  the  development  effort  may  directly  influence  the 
choice  of  minicomputer  to  be  used  in  the  application. 

An  obvious  approach  to  this  problem  is  to  simulate  the  target  minicomputer 
on  available  large  scale  equipment.  This  approach  poses  certain  difficulties. 
Even  if  a package  of  subroutines  to  simulate  the  minicomputer  arithmetic  is 
accessible  from  a high-level  language,  the  programming  problems  are  difficult. 
Since  calling  upon  such  subroutines  usually  amounts  to  coding  in  assembly 
language  disguised  in  the  syntax  of  a high-level  language,  nearly  all  of  the 
problems  of  assembly  language  coding  face  the  programmer:  the  programming 
time  is  excessive,  the  programs  are  difficult  to  comprehend  and  correct,  etc. 

A solution  to  these  difficulties  is  the  use  of  an  appropriate  preprocessor. 
The  input  to  the  preprocessor  is  the  desired  program  expressed  in  a high-level 
programming  language.  The  output  of  the  preprocessor  is  the  same  program 
expressed  in  terms  of  references  to  the  subroutines  simulating  the  minicomputer. 
Thus  the  preprocessor  has  done  the  "dirty  work"  of  preparing  the  disguised 
assembly  language  version  of  the  program. 

A desirable  capability  of  such  a preprocessor  is  to  be  able  to  accept  a 
description  of  the  supporting  subroutine  package  and  produce  a program  accord- 
ing to  that  description.  This  can  be  useful  if  a number  of  different  simulations 
must  be  made — neither  the  input  program  nor  the  preprocessor  need  be  changed, 
only  the  description. 

Such  a preprocessor  to  extend  Fortran  (called  AUGMENT)  is  available 
from  the  Mathematics  Research  Center.  The  input  language  is  Fortran  extended 
by  user-defined  data  types,  operators,  and  functions.  AUGMENT'S  output  is 
an  ANSI  Standard  Fortran  program  with  nonstandard  operations  replaced  by 
calls  to  appropriate  subroutines.  The  language  extension  capabilities  of 
AUGMENT  make  it  suitable  for  an  application  needing  nonstandard  arithmetics. 
Some  other  applications  are  mentioned  in  the  text. 


Sponsored  by  the  United  States  Army  under  Contract  No.  DAAG29-7  5-C-0024. 
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1.  INTRODUCTION.  The  initial  problem  that  we  address  is  the  develop- 
ment of  software  for  minicomputer  applications  on  large  scale  computers.  This 
development  may  be  required  for  a number  of  reasons: 

Algorithms  may  be  destined  for  a number  of  different 
computers,  more  than  can  conveniently  be 
made  available  to  the  developer. 

Algorithms  may  be  of  varying  sophistication  for 
different  applications. 

The  requirements  of  the  algorithm  may  determine  the 
minicomputer  to  be  used  in  the  application; 
that  is,  the  algorithm  development  must  precede 
acquisition  of  the  minicomputer. 

The  file  storage  and  editing  capabilities  of  the  large 
system  may  offer  an  attractive  development 
environment. 

Any  number  of  these  reasons  (and  possibly  others)  may  combine  to  make 
software  development  and/or  maintenance  on  large  scale  equipment  desirable 
or  necessary. 

An  obvious  means  of  performing  such  development  begins  with  the 
development  or  acquisition  of  a package  of  subroutines  that  allows  the 
simulation  of  the  essential  aspects  of  the  arithmetic  on  the  minicomputer 
to  be  used  for  the  application.  The  proposed  algorithms  are  then  tested 
in  the  large  machine  by  coding  references  to  the  routines  in  the  simulation 
package. 

This  approach  to  development  has  several  difficulties.  First,  consider 
the  kind  of  code  that  must  be  written.  In  Fig.  l(a),  we  have  a well-known 
algebraic  expression.  We  assume  that  a straightforward  evaluation  of  this 
expression  is  desired  (the  proper  choice  of  numerical  method  is  a very 
important  subject,  but  we  do  not  concern  ourselves  with  it  in  this  paper). 

If  we  were  to  evaluate  this  expression  by  references  to  a collection  of  three 
argument  subroutines,  we  would  obtain  code  of  the  form  shown  in  Fig.  1(b). 

If  the  simulating  routines  supported  a simulated  accumulator,  Fig.  i(c) 
could  result.  Finally,  simulating  routines  implemented  as  functions  could 
yield  Fig.  1(d). 
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(a)  original  expression 

CALL  MUL  (B,  B,  TMP(l)) 

CALL  MUL  (FOUR,  A,  TMP(2) ) 

CALL  MUL  (TMP(2),  C,  TMP(2)) 

CALL  SUB  (TMP(l),  TMP(2),  TMP(2)) 

CALL  RT2  (TMP(2),  TMP(2)) 

CALL  SUB  (TMP(2) , B,  TMP(2)) 

CALL  ADD  (A,  A,  TMP(l)) 

CALL  DIV  (TMP(2) , TMP(l),  X) 

(b)  subroutine  package 

CALL  LOAD  (FOUR) 

CALL  MUL  (A) 

CALL  MUL  (C) 

CALL  STORE  (TMP(l)) 

CALL  LOAD  (B) 

CALL  MUL  (B) 

CALL  SUB  (TMP(U) 

CALL  RT2 
CALL  SUB  (B) 

CALL  DIV  (TWO) 

CALL  DIV  (A) 

CALL  STORE  (X) 

(c)  simulated  accumulator 

X = QUOT  (DIFF  (RT2  (DIFF  (PROD  (B,  B),  PROD 
(FOUR,  PROD  (A,  C)))),  B),  SUM  (A,  A)) 

(d)  functions 


Figure  1 . 

Code  generated  by  calling  on 
simulating  packages 
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The  point  of  Fig.  1 is  that  the  resulting  code  to  be  used  to  test  the 
algorithm  is  essentially  assembly  language  code.  The  use  of  a high-level 
language  and  compiler  serve  only  to  disguise  that  fact,  and  not  very  well 
since  more  code  might  be  written  than  would  be  required  by  a true  assembly 
language  program.  Programs  written  in  this  form  suffer  from  almost  all  the 
faults  of  assembly  language  programs: 

programming  time  and  effort  are  excessive 
programs  are  difficult  to  comprehend  and  maintain 
debugging  is  more  difficult 

algorithm  testing  is  more  difficult  because  it  is  harder 
to  make  changes 

An  additional  difficulty  can  occur  if  more  than  one  machine  is  to  be 
simulated  in  this  fashion.  If  the  simulating  packages  for  the  various 
machines  are  obtained  from  different  sources,  they  may  have  different 
calling  conventions.  Thus  several  different  versions  of  the  same  program 
may  need  to  be  written,  debugged,  and  maintained  during  the  development 
process. 

2.  AUGMENT.  The  Mathematics  Research  Center  has  developed  the 
AUGMENT  precompiler  to  solve  problems  of  the  sort  discussed  above. 
AUGMENT  is  a flexible  preprocessor  that  extends  the  Fortran  language  to 
include  nonstandard  data  types,  operators,  and  functions.  Its  operation  is 
summarized  in  Figure  2.  The  input  to  AUGMENT  is  a source  program  written 
in  the  extension  to  Fortran,  accompanied  by  a Description  Deck. 

The  Description  Deck  contains  information  about  the  extension  to  the 
Fortran  language  and  how  the  extension  is  implemented  by  the  simulating 
package.  A portion  of  a Description  Deck  with  an  explanation  of  its  contents 
is  contained  in  the  Appendix. 

To  test  an  algorithm  for  a minicomputer  using  AUGMENT,  one  would 
need  a simulating  package  that  supported  the  minicomputer  data  types  to  be 
used  in  the  algorithm  (INTEGER,  REAL,  and  perhaps  DOUBLE  PRECISION), 
and  a Description  Deck  for  the  package.  Then  the  algorithm  is  written  in 
Fortran  with  the  various  quantities  declared  as  nonstandard  data  types  as 
specified  by  the  Description  Deck.  A pass  through  AUGMENT  generates  a 
Fortran  program  which  makes  calls  on  the  simulating  package  (see  Fig.  3). 
From  that  point,  compilation,  linking/loading,  and  testing  proceed  as  usual. 

AUGMENT  eliminates  the  need  to  program  in  disguised  assembly 
language  by  creating  that  form  automatically.  AUGMENT  also  eliminates  the 
need  for  several  versions  of  the  same  algorithm  if  more  than  one  machine  is 
to  be  simulated.  In  this  case,  only  the  Description  Deck  need  be  changed  to 
change  simulated  machine  and  adapt  to  different  calling  conventions. 


336 


Figure  2. 

Summary  of  AUGMENT  use 
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3.  OTHER  APPLICATIONS.  AUGMENT  can  be  applied  in  any  application 
where  nonstandard  data  types  or  operations  are  needed.  Some  of  these 
applications  are: 

-instruction  counts.  To  obtain  an  estimate  of  the  amount  of  work 
performed  by  a subroutine,  one  can  prepare  a simulating 
package  which  performs  the  usual  machine  arithmetic  opera- 
tions with  the  side  effect  of  counting  the  number  of  opera- 
tions that  are  performed.  This  may,  of  course,  be  combined 
with  a minicomputer  simulation. 

-multiple  precision  arithmetic.  On  those  occasions  when  one  requires 
greater  accuracy  than  is  provided  by  the  machine,  a software 
multiple  precision  may  be  easily  employed  with  the  aid  of 
AUGMENT. 

-interval  arithmetic.  Interval  arithmetic  is  a tool  for  obtaining  valid 
results  in  spite  of  the  inherent  inaccuracies  of  machine 
computation.  The  method  computes  with  closed  intervals 
rather  than  single  numbers.  See  [ 3]  for  more  information. 

-Taylor  series.  In  a package  under  development,  a truncated  Taylor 
series  is  maintained  for  each  variable.  When  an  operation 
is  performed,  the  Taylor  series  for  the  result  is  computed. 


AUGMENT  input: 


X = (-B  + SQRT  (B*B  - 4.*A*C))  / (2.*A) 
AUGMENT  output: 


CALL  NEG  (B,  TMP(l)) 

CALL  MUL  (B,  B,  TMP(2)) 

CALL  CNV  (4.,  TMP(3) ) 

CALL  MUL  (TMP(3),  A,  TMP(3)) 

CALL  MUL  (TMP(3) , C,  TMP(3)) 

CALL  SUB  (TMP(2) , TMP(3),  TMP(3)) 
CALL  RT2  (TMP(3),  TMP(3)) 

CALL  ADD  (TMP(l),  TMP(3),  TMP(3)) 
CALL  CNV  (2.,  TMP(l)) 

CALL  MUL  (TMP(l),  A,  TMP(l)) 

CALL  DIV  (TMP(3),  TMP(l),  X) 


Figure  3. 


Sample  of  AUGMENT  translation 
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4 . AVAI L ABI LI T Y . AUGMENT  is  currently  (March  197  6)  available 
for  the  following  systems: 

UNI  VAC  1100 
IBM  370 
CDC  7 600 
Honeywell  635 

Copies  of  AUGMENT  have  been  supplied  to  persons  intending  to  implement 
it  on  the  following  systems: 

MULTICS 
DEC  PDP-lO 

Portability  of  AUGMENT  has  been  found  to  be  very  good  (in  one  week, 
both  the  IBM  and  CDC  versions  were  brought  up).  The  precompiler  is 
written  in  ANSI  Standard  Fortran  except  for 

(1)  eight  machine  dependent  primitive  routines  which  require  about 
150  lines  (total)  to  implement,  and 

(2)  some  minor  oversights  which  are  documented  in  [2]. 

5.  CONCLUSION.  The  AUGMENT  precompiler  is  a tool  which  simplifies 
the  programming  process  when  nonstandard  arithmetics  and  data  types  are 
required.  The  applications  for  AUGMENT  include  simulation  of  minicomputers, 
word  length  sensitivity  analysis,  multiple  precision  arithmetic,  operation 
counting,  and  interval  arithmetic. 

AUGMENT  will  not  cure  all  world's  programming  problems,  but  can 
be  of  great  assistance  in  dealing  with  some  problems.  One  user  of  AUGMENT 
put  it  this  way:  "AUGMENT  isn't  a program  that  you  need  all  the  time — but 
when  you  need  it,  you  really  need  it.  " 
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APPENDIX 


This  Appendix  presents  a portion  of  a Description  Deck  for  AUGMENT 
with  a brief  explanation  of  its  contents.  A portion  of  a Description  Deck 
for  a double  precision  complex  data  type  is  presented  in  Figure  4.  The  line 
numbers  in  the  Figure  are  added  for  easy  reference  from  the  text.  The 
explanation  below  is  by  line  number. 


Line  1 . The  first  line  of  the  description  of  a data  type  specifies  the 
name  of  the  type.  A program  translated  under  this  description  may  use  the 
data  type  name  DBLCOMPLEX  in  the  same  way  that  INTEGER,  REAL,  etc.  are 
used  in  a standard  Fortran  program.  In  the  lines  following,  the  character 
"$"  is  an  abbreviation  for  "DBLCOMPLEX".  This  abbreviation  is  built  into 
AUGMENT  to  reduce  the  size  of  the  Description  Deck. 


1.  *DESCRIBE  DBLCOMPLEX 

2.  DECLARE  DOUBLE  PRECISION  (2),  KIND  SAFE  SUBROUTINE, 

PREFIX  DPC 

3.  OPERATOR  - (NEG,  UNARY,  PREVIOUS,  $) 

4.  OPERATOR  + (,  NULL  UNARY,  PREVIOUS,  $) 

5.  OPERATOR  + (ADD,  BINARY  1,  PREVIOUS,  $),  - (SUB), 

* (MUL(SUBROUTINE) ) , / (DIV(SUB)) 

6.  OPERATOR  .EQ.  (EQ,  BINARY  2,  PREVIOUS,  $,  LOGICAL), 

.NE.  (NE) 

7.  FUNCTION  CLOG  (LN,  ($),$),  LN  () 

8.  CONVERSION  CTDC  (CFI,  INTEGER,  $,  UPWARD),  CTDC  (CFR,  REAL) 


Figure  4. 


Portion  of  Description  Deck  for 
Double  Precision  Complex  Arithmetic 
(Line  numbers  are  not  part  of  the 
Description  Deck.) 
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Line  2.  The  second  line  gives  the  general  information  about  the  data 
type  and  its  supporting  package  of  subprograms.  The  first  clause  (DECLARE 
DOUBLE  PRECISION  (2))  specifies  the  way  in  variables  of  type  DBLCOMPLEX 
are  to  be  represented  in  the  translated  program.  A declaration  of  the  form 

DBLCOMPLEX  A,  B(  1 0) 

would  be  translated  to  the  form 

DOUBLE  PRECISION  A(2),  B(2, 1 0) 

The  second  clause  indicates  the  nature  of  the  supporting  routines.  Three 
choices  are  available:  function,  subroutine  (three  argument),  and  simulated 
accumulator.  The  clause  SAFE  SUBROUTINE  specifies  that  the  routines  are 
three  argument  subroutines  and  that  it  is  safe  to  allow  the  output  argument 
to  be  one  of  the  operand  arguments.  This  specification  may  be  overridden 
for  specific  routines  (see  Line  5).  The  last  clause  (PREFIX  DPC)  specifies 
the  manner  in  which  the  routines  are  named.  Unless  overridden,  all  routine 
names  will  begin  with  the  letters  "DPC"  as  will  the  names  of  all  temporary 
locations  generated  by  AUGMENT. 

Line  3.  Beginning  with  Line  3,  we  describe  the  operators  that  may 
have  DBLCOMPLEX  operands.  Line  3 describes  the  unary  minus  (negation) 
operator.  It  is  implemented  by  the  subroutine  DPCNEG  (the  "DPC"  is  not 
written  since  it  is  the  prefix — AUGMENT  adds  it).  The  remaining  information 
in  the  line  specifies  that  the  operator  is  unary,  that  its  position  in  the 
operator  hierarchy  is  previously  defined  (the  hierarchy  position  is  attached 
to  the  symbol  used  in  the  source  program),  and  that  this  description  of  the 
operator  is  for  operands  of  type  DBLCOMPLEX  (the  result  is  also  of  type 
DBLCOMPLEX  since  unary  operators  do  not  change  type). 

Line  4 describes  the  unary  + operator.  Since  the  description  contains 
the  phrase  "NULL  UNARY",  AUGMENT  will  discard  all  unary  + operators  with 
an  operand  of  type  DBLCOMPLEX.  Since  it  is  to  be  discarded,  no  routine 
name  need  be  specified. 

Line  5.  Next  come  the  arithmetic  operators.  The  notation  "BINARY  1" 
specifies  that  the  operator  being  described  is  binary  (has  two  operands)  and 
that  both  operands  and  the  result  for  this  description  have  the  same  type 
(DBLCOMPLEX).  This  line  also  indicates  the  "drop  out"  rules  allowed  by 
AUGMENT.  The  subtraction  operator  has  exactly  the  same  description  as  the 
addition  operator  except  for  subroutine  name,  hence  only  the  subroutine 
name  need  be  specified.  For  the  multiplication  and  division  operators,  the 
routines  are  such  that  it  is  not  safe  for  the  result  argument  to  be  one  of  the 
operand  arguments.  Hence  the  "SAFE  SUBROUTINE"  specification  of  Line  2 
is  overridden  by  the  specification  "SUBROUTINE"  which  may  be  abbreviated 
"SUB". 
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Line  6.  The  relational  operators  . EQ.  and  .NE.  are  also  meaningful 
for  complex  types.  The  "BINARY  2"  specification  means  that  the  description 
will  contain  two  data  type  names:  the  first  is  the  type  of  the  operands;  and 
the  second,  the  type  of  the  result.  Since  none  of  the  other  relational 
operators  are  defined  for  DBLCOMPLEX  operands,  their  use  will  cause 
AUGMENT  to  diagnose  an  error. 

Line  7 . Next  we  describe  two  of  the  functions  which  may  operate  on 
DBLCOMPLEX  operands.  Functions  for  AUGMENT  are  generic.  The 
logarithm  function  is  described  as  being  implemented  by  the  routine  DPCLN 
when  it  appears  with  the  argument  list  "($)",  (that  is,  when  it  appears  with 
a single  argument  of  type  DBLCOMPLEX),  and  yields  a result  of  type 
DBLCOMPLEX.  This  function  may  also  be  called  by  the  name  LN  with  the 
same  description. 

Line  8 describes  two  conversion  functions.  Conversion  functions  are 
applied  automatically  in  certain  cases  of  mixed  mode  operands  or  may  be 
coded  in  the  program.  The  function  CTDC  (Convert  To  Dbl Complex)  is 
defined  for  INTEGER  and  REAL  arguments  with  DBLCOMPLEX  results.  In 
each  case,  the  conversion  is  "upward"  meaning  that  DBLCOMPLEX  is  the 
preferred  type  if  automatic  conversion  is  applied.  That  is,  the  expression 
"I+D"  is  equivalent  to  "CTDC(I)+D"  if  I is  INTEGER  and  D is  DBLCOMPLEX. 
"Downward"  conversion  functions  are  applied  automatically  only  across  the 
replacement  operator. 

Other  features  and  options  are  available  which  are  not  illustrated 
by  this  example.  The  user  documentation  [ l]  describes  these  cases. 
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USE  OF  A MIN I -COMPUTER  FOR  ON-LINE  REAL-TIME 
PROCESSING  OF  MASS  SPECTRAL  DATA  FROM  MULTIPLE  MASS  SPECTROMETERS 

D.H,  Robertson  and  C,  Merritt,  Jr, 

U,S,  Army  Natick  Research  and  Development  Command 
Natick,  Massachusetts  01760 


ABSTRACT 

One  of  the  greatest  challenges  .in  the  area  of  automated  data  processing  is 
the  output  from  a mass  spectrometer;  combined  gas  chroma tography/mass 
spectrometry  provides  still  more  stringent  demands  on  the  data  processing 
system.  In  order  to  realize  the  goal  of  on-line  real-time  acquisition  and 
reduction  of  mass  spectrometry  data  from  multiple  instruments,  special 
means  were  developed  for  this  purpose.  As  an  adjunct  to  acquisition  and 
reduction  processes,  there  exists  the  problem  of  identification  of  the 
sample  by  means  of  library  search  routines  which  must  be  rapid  to  be 
effective . 


Several  mathematical  manipulations  were  conceived,  and  evaluated  on  the  raw 
data  to  facilitate  identification;  library  file  structure  was  investigated 


as  a primary  means  effecting  this.  These  approaches  to  identification  and 
the  library  file  structure  are  discussed.  Typical  of  the  quest' ons  which 


must  be  answered  in  the  latter  case  are: 
library  search  algorithm  and  the  optimum 


That  are  the  features  of  an  optimum 
library  size;  and  :is  it  reasonable 


to  combine  pattern  classification  techniques  with  a library  search?  Also 


the  computer  system,  a DTC  15/76,  legitimately  in  the  mini- computer  class, 
is  described;  especially  as  it.  relates  to  the  automation  of  an  analytical 


chemistry  laboratory. 
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The  primary  function  of  the  computer  system  in  the  Analytical  Chemistry 
Laboratories  at  the  Natick  r&D  Command,  has  been  to  automate  the  instrumen- 
tation used  to  perform  chemical  analyses.  The  most  challenging  aspects 
of  automation  are  presented  by  on-line  real-time  acquisition,  reduction 
and  interpretation  of  mass  spectrometer  output.  In  the  area  of  interpre- 
tation, several  mathematical  data  treatments,  in  succession,  have  been 
developed  which  have  been  used  in  sequence  as  the  size  of  the  data  base 
has  increased.  The  hardware  and  software  specifications  of  our  system 
are  described  herein  followed  by  a review  of  the  mathematical  treatments 
and  presentation  of  some  applications  of  the  overall  system. 

The  basic  system  is  a PDP15/76,  manufactured  by  Digital  Equipment 
Corporation.  It  is  equipped  with  48K  of  core  memory  in  the  central 
processor  configuration  and  2 PDP11  memories  respectively,  controlling 
the  UNIBUS  access  to  the  15  memory  and  the  I/O  operations  for  the  GT40,  a 
graphic  display  system.  The  15  memory  directly  controls  a fixed  head  disc 
with  a 250,000  word  capacity  i.e,  2^8  (262,144);  a cartridge  disc  system 
operates  through  the  peripheral  processor,  which  allows  the  15  memory  to 
access  the  11  memory  associated  with  the  unibus. 

Figure  1 depicts  the  basic  units  of  our  system  as  well  as  some  of  the 
experiments  which  are  interfaced  with  the  system.  The  slave  computer  is 
a HP  2116B  which  was  the  original  venture  into  computer  technology  in  the 
Analytical  Chemistry  Laboratories.  Simply  because  the  acquisition  and 
reduction  routines  were  fully  developed  for  this  disc-oriented  system, 
it  has  been  kept  in  the  link,  handling  output  from  a medium  resolution 
mass  spectrometer,  the  output  of  which  is  digitized  directly  with  a PAD 
(pulse  amplifier  discriminator). 
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The  TOF  Mass  Spectrometer  is  equipped  with  a CSI  260  high  speed 
digitizer  as  a front  end  prior  to  direct  data  transmission  to  the  15/76 
system.  The  high  resolution  mass  spectrometer  is  simularly  front-ended 
with  a high  speed  digitizer  unit  custom  built  for  the  Laboratories  by 
Adage. 

It  is  possible  to  effect  compound  identification  from  first  principles, 
an  approach  whereby  the  structure  of  each  unknown  is  assigned  as  a function 
of  characteristic  mass  peaks  in  a spectrum  which  refer  directly  to  the 
presence  of  specific  bonding  and/or  chemical  functionality.  As  such, 
a library  is  not  necessary. 

In  most  cases,  however,  the  identification  of  an  unknown  compound 
from  its  mass  spectrum  by  automatic  data  processing  assumes  the  existence 
of  a file  of  data  for  a large  number  of  known  compounds  and  the  capability 
for  searching  that  file  in  a manner  which  will  provide  component  identi- 
fication. The  conventional  approach  uses  tables  of  mass  vs,  intensity 
values  that  constitute  digitized  mass  spectra.  By  comparing  the  unknown 
to  each  known  spectrum  in  the  library,  it  is  possible  to  achieve  identifi- 
cation from  the  best  match  of  the  unknown  to  a known  spectrum  in  the 
library  file. 

Because  of  the  large  number  of  spectra  to  be  searched  in  a typical 
application,  that  of  combined  gc/ms  operation  for  instance,  the  computer 
configuration  required  for  identification  of  unknown  based  on  use  of 
all  their  mass  and  intensity  data  i.e.,  the  complete  spectrum,  is  normally 
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quite  large  and  counter  to  the  best  interests  of  efficient  on-line 
real-time  handling  of  mass  spectral  data.  In  particular,  there  is  a 
need  in  the  average  analytical  laboratory  for  searches  on  small, 
relatively  inexpensive  computers  which  remain  inexpensive  only  if  their 
peripheral  units  are  confined  to  modest  size  and  capability. 

Historically,  three  methods  were  developed  for  encoding  spectra  as 
compressed  data: 

1.  Calculation  of  entropy  function 

2.  Calculation  of  divergence 

3.  Selected  binary  encoding 

The  first  two  approaches  to  the  classification  of  mass  spectral 
data,  namely  the  calculation  of  the  Khinchine  entropy  function  and  of  the 
divergence  function,  are  derived  from  set  theory  and  are  based  on  the 
statistical  distribution  of  peaks  in  a mass  spectrum.  These  as  well  as 
the  selected  binary  encoding  which  is  described  below,  have  in  common  the 
reduction  of  the  mass  spectrum  to  a single-valued  number  which  is  diagnostic 
within  some  range  for  the  compound  with  which  it  is  associated.  ( Fig.  2.) 

The  entropy  function  is  calculated  by  summing  the  products  of 
individual  ion  abundances  p and  their  respective  logarithms,  "p" 
thus  represents  the  ion  abundance  in  terms  of  the  percent  of  total  ioni- 
zation of  the  molecule  in  question  or  in  another  sense,  more  germane 
here,  the  probability  of  occurrence  of  that  ion  fragment  in  the  spectrum, 

Mass  spectra  are  thus  converted  to  a single  valued  number  and  in  this 
way  a data  file  can  be  constructed  consisting  of  these  numbers.  An 
example  is  shown  in  Table  1.  These  compounds  have  been  selected  to  show 
the  typical  variation  in  the  "entropy"  value  which  is  expected  for  the 
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variation  in  the  degree  of  unsaturation  in  the  molecule.  In  searching  a 
file  of  precalculated  Khinchine  values,  a matching  index  is  used  to 
establish  correspondence  of  the  value  for  an  unknown  with  the  library 
value.  In  the  early  work  with  Khinchine  values,  the  construction  of  the 
reference  file  was  limited  to  a few  hundred  compounds  which  are  normally 
encountered  in  the  analysis  of  the  volatile  components  of  natural  products, 
most  especially  of  foodstuffs. 

When  applied  to  large  libraries,  in  the  order  of  7000  compounds,  the 
range  of  values  of  the  Khinchine  function  was  not  sufficiently  unique. 

An  example  of  this  situation  is  shown  in  Table  2. 

To  provide  differentiation  between  two  compounds,  the  Khinchine 
functions  of  which  are  too  nearly  alike  to  be  diagnostic,  the  second 
function  on  our  list;  namely,  the  divergence  calculation  was  invoked. 

The  formula  for  this  calculation  appears  in  Figure  3.  Here  and  N2 
represent,  respectively  the  total  number  of  ions  in  each  of  the  2 compounds 
being  compared  and  "p"  has  the  same  meaning  as  in  the  Khinchine  function 
calculation.  In  practice  it  has  been  found  convenient  to  refer  the 
calculation  of  divergence  of  a given  compound  in  the  aliphatic  hydrocarbon 
series  for  instance,  to  the  normal  straight-  chained  alkane  of  the  same 

carbon  number.  In  Table  3 are  listed  the  divergence  values  for 
several  Cg  hydrocarbons  referred  to  N-hexane.  For  example,  in  the  case 
of  two  compounds  such  as  hexene  and  methyl pentene,  the  divergence  values 
are  markedly  different. 

Considering  the  fact  that  the  final  format  in  which  data  are  handled 
in  a digital  computer  is  in  the  binary  world  we  were  led  to  develop  an 
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approach  which  we  call  selected  binary  coding.  This  code  allows  compress 
sion  of  the  library  file  through  selective  binary  coding  of  characteristic 
peaks  and  use  of  variable  length  logical  records, 

The  coding  procedure  is  illustrated  in  Figure  4.  A hypothetical 
mass  spectrum  is  shown  with  a representation  of  a 16  bit  computer  word 
at  the  top.  Selective  binary  coding  is  accomplished  by  dividing  the  mass 
range  of  interest  into  multiple  groups  of  seven.  The  number  corresponding 
to  the  peak  in  the  spectrum  in  each  group  which  has  the  highest  intensity 
is  then  coded  as  a three  bit  binary  number.  In  this  example  the  fourth 
position  is  encoded  in  the  first  grouping,  the  seventh  in  the  second 
and  so  on;  zero  is  used  to  denote  the  absence  of  a peak  within  the  group- 
ing, thereby  giving  a total  of  eight  possible  values,  hence  the  term 
octal  coding  by  which  we  have  designated  this  scheme, 

Representation  of  an  octal  number  within  the  computer  requires 
three  bits;  thus,  in  a 16-bit  machine  such  as  the  HP  2116B  used  in  setting 
up  this  system,  five  octal  characters  can  be  stored  in  each  computer  word 
with  one  bit  left  over.  A single  computer  word  is  capable  of  storing 
information  which  spans  a range  of  35  amu.  Compounds  with  a greater 
mass  range  require  additional  computer  words.  As  many  as  needed  are  used; 
the  last  word  is  designated  by  setting  a flag  in  the  16th  bit.  A further 
illustration  of  this  system  of  encoding  is  shown  in  Table  4, 

Since  15  bit  positions  allow  2^  or  32768  unique  representations,  it 
is  possible  to  encode  that  many  compounds  in  a unique  manner. 

The  octal  code  for  a mass  spectrum  may  be  easily  obtained  from 
digitized  mass  and  intensity  data  acquired  on-line  and  stored  for  subsequent 
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processing.  A flow  chart  of  the  procedure  is  shown  in  Figure  4.  A 
normal  sequence  of  data  processing  involves  the  following  on-line 
operations. 

1.  Conversion  of  analog  output  to  digital  i.e,,  mass  and  intensity 

data. 

2.  Condensation  of  the  data  to  octal  format  utilizing  the  routine 
in  this  slide,  thus  providing  the  binary  equivalent  of  the  "spectrum" 
for  which  the  library  is  being  searched. 

3.  When  binary  "1"  is  sensed  in  the  16th  bit,  the  number  of  words 
to  encode  the  "spectrum"  is  known,  then  it  is  not  necessary  to  search 
the  entire  library  but  only  the  subset  or  sub-library  collection  which 
requires  that  number  of  words  for  coding. 

The  subdivision  into  subfiles  of  variable  record  length  and  creation 
of  a name  file  were  designed  to  make  maximum  use  of  random  access  mass 
storage  devices. 

For  each  unknown  compound  being  searched,  a matching  index  is  cal- 
culated; the  five  best  matches  are  printed  out,  thereby  anticipating 
the  possibility  for  identical  or  nearly  identical  matching  indices. 

The  usefulness  of  selective  binary  coding  for  search  and  retrieval  from 
large  data  files  has  been  well  illustrated  in  the  literature  and  for  this 
purpose  it  actually  matters  little  what  size  window  one  uses.  If, 
however,  one  is  to  make  use  of  the  diagnostic  information  which  is 
contained  in  a mass  spectrum,  the  size  of  the  window  and  the  starting 
masses  for  each  window  become  vitally  important.  Consider  Table  5- 
The  vertical  column  at  the  left  refers  to  the  number  of  the  window  under 
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consideration  and  the  horizontal  sequence  across  the  top  of  the  table 
consists  of  the  octal  digits  below  each  of  which  appear  the  masses 
corresponding  to  the  octal  code.  The  first  window, for  example,  will 
contain  the  masses  coded  octally  from  23  to  29.  A series  of  masses 
corresponding  to  a series  of  ions  such  as  29,  43,  57  and  71  etc, 
represents  alkyl  fragments  and  will  each  be  encoded  as  the  7th  digit 
in  an  odd-numbered  window. 

A pattern  of  codes  is  seen  in  Table  6 which  demonstrates  the  recur- 
ring octal  digit  pattern  for  various  series  of  ions  which  correspond  to 
several  common  functional  groups.  The  hydrocarbon  fragments  will  always 
be  coded  in  odd-numbered  windows,  and  when  the  skeletal  structure  is  a 
saturated  aliphatic  hydrocarbon,  one  would  expect  the  most  abundant  peaks 
to  be  a series  of  alkyl  fragments  which  would  be  coded  as  octal  7. 

If  the  compound  is  unsaturated,  the  octal  code  would  be  a 5.  Moreover, 
ions  which  are  still  more  unsaturated,  such  as  the  alkynyl  series  having 
the  formula  CnH2nr-3  and  consisting  of  the  series  39,  53,  67  etc,, 
will  be  represented  by  octal  digit  3 in  odd-numbered  windows.  It  should 
be  noted  that  oxygen  containing  peaks  such  as  those  arising  from  ketones 
and  aldehydes  are  isometric  with  the  alkyl  ion  fragments  and  will  be 
coded  in  octal  7 in  odd-numbered  windows.  The  code  pattern,  which 
occurs  in  the  even-numbered  windows  and  produces  the  series  of  ions 
which  is  characteristic  for  alcohols  etc,,  is  seen  in  the  bottom  left 
row  of  the  table. 

Additional  octal  code  patterns  which  are  characteristic  of  functional 
group  type  compounds  are  seen  in  the  right  hand  column.  Thus,  the 
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appearance  of  a particular  digit  as  a series  in  an  even-numbered  window 
may  be  correlated  with  a particular  structural  feature.  Generally 
speaking,  octal  patterns  from  the  odd-numbered  windows  are  less  specific, 
whereas  the  patterns  for  ions  encoded  in  even-numbered  windows  are  more 
closely  related  to  functionality.  In  order  to  achieve  the  accurate 
identification  for  an  unknown  spectrum  as  created  on-line  from  a GS/MS  run, 
it  is  necessary  to  use  a combination  of  odd  and  even  numbered  windows. 

Some  examples  taken  from  real  data  coded  from  compounds  selected 
at  random  from  a file  of  mass  spectral  data  are  seen  in  Table  7.  The 
top  left  shows  some  patterns  for  alcohols.  Compounds  A-D correspond 
to  ethanol,  isopropanol,  n-propanol  and  2-ethyl-l-butanol . The  first 
column  of  digits  corresponds  to  the  first  odd-numbered  window  containg 
masses  23-29.  The  successive  odd-numbered  windows  contain,  for  all  the 
compounds,  digits  representing  hydrocarbon  series  of  fragment  ions.  The 
corresponding  even-numbered  windows  all  contain  the  octal  digit  2 which  is 
characteristic  of  rearrangement  ions  for  alcohols  and  other  oxygenated 
species.  In  the  case  of  the  isopropanol  and  ethylbutanol  the  octal  digit  5, 
rather  than  7,  appears,  suggesting  the  possibility  of  branching.  The  6 
which  appears  in  the  3rd  window  for  compound  C is  an  error  in  coding; 
normally  a 7 would  be  expected  to  appear. 

At  the  top  right  patterns  for  various  esters  are  shown.  As  with 
alcohols  the  octally  encoded  digit  in  the  odd-numbered  windows  is  a 7 
which  corresponds  to  the  series  of  alkyl  fragment  ions.  In  the  even- 
numbered  windows  the  2s  appear,  representing  oxygenated  species.  In  this 
case  the  pattern  at  the  end  of  the  code  is  found  to  be  diagnostic  for  esters. 
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The  pattern  for  aldehydes  is  similar.  Again,  sevens  occur  in  odd- 
numbered  windows  representing  alkyl  fragments.  In  this  case  there  is  a 
contribution  to  these  ions  from  the  isometric  ion  containing  CO.  Compounds 
C and  D are  actually  2 different  spectra  for  3-methyl butanal . These 
again  suggest  the  possibility  for  detecting  branching  by  the  appearance 
of  5 in  one  of  the  odd-numbered  windows.  The  succession  of  Is  is  found 
to  be  characteristic  for  both  aldehydes  and  ketones. 

Most  important  in  the  overall  is  the  success  of  this  technique  of 
coding  when  used  with  real  data  as  obtained  on-line  and  in  real-time 
from  a standard  GC/MS  system.  Moreover,  experience  with  the  technique 
adds  further  support  to  the  choice  of  coding  one  mass  in  7 as  opposed 
to  one  mass  in  every  14  amu.  This  choice  provides  greater  success  in 
identification  of  functional  group  character  when  the  compound  for  which 
one  is  searching  is  not  in  the  library. 

The  most  demanding  aspect  of  the  operations  within  the  Analytical 
Chemistry  Group  is  associated  with  monitoring  the  volatile  components 
in  irradiated-preserved  foods.  A study  has  already  been  completed  to 
indicate  that  there  are  no  toxic  substances  produced  during  this  preserva- 
tion process;  the  current  study  of  volatile  components  is  being  conducted 
in  conjunction  with  an  animal  feeding  study  to  determine  the  alteration 
in  nature,  if  any,  of  the  nutritional  quality  of  the  food  stuff.  As 
such,  large  numbers  of  samples  are  produced  from  the  rapid  scanning  mass 
spectrometric  monitoring  of  the  eluent  from  a gas  chromatograph,  by 
means  of  which  the  volatile  components  are  separated  subsequent  to  vacuum 
distillation  into  rough  fractions  based  on  b.p.  of  component. 
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There  are  two  aspects  of  computer  applications  to  this  work.  The 
first  is  the  automatic  acquisition,  reduction  and  interpretation  of 
mass  spectrometric  data  based  on  the  principles  of  octal  codification  which 
have  been  presented  earlier.  The  repetitive  nature  of  this  application 
allows  high  accuracy  in  identification  insomuch  as  one  is  looking  for 
differences  among  samples  which  relate  to  nutritional  value. 

The  second  aspect  of  this  deals  with  statistical  analysis  of  the 
quantitative  data  which  are  produced  from  the  studies,  i.e.,  a study  of 
the  variation  in  quantitative  values  for  the  various  components  determined 
as  a function  of  sample  treatment,  method  used  for  irradiation  and  storage 
time.  It  is  customary  to  plot  time  of  storage  after  irradiation  vs. 
concentration  of  component(s)  for  the  various  procurements  or  lots  of 
meat,  thereby  indicating  scatter  of  the  data.  Included  in  this  plot 
are  usually  the  x and  2 sigma  lines  for  each  collection  of  data.  The 
previously  described  investigations  have  been  conducted  on  output  from 
low  resolution  mass  spectrometers.  At  the  time  of  this  writing  the 
addition  of  high  resolution  capability  is  nearing  completion;  it  is  based 
on  electrical  processing  of  the  signal  from  a CEC  21-110  high  resolution 
mass  spectrometer  operating  via  a high  speed  digitizing  unit  which  was 
custom  built  for  these  laboratories  by  Adage.  The  Adage  device  inter- 
faces through  traditional  links,  directly  with  the  computer  where  the  data 
are  processed  with  standard  routines  to  provide  exact  mass  values. 

Although  in-house  requirements  have  not  yet  dictated  the  interface 
of  our  standalone  gas  chromatographs  to  the  system,  the  multi-programming 
aspects  of  the  PDP  15/76  system  will  readily  allow  the  addition  of  many 
slow  devices  such  as  these,  liquid-liquid  chromatographs  and  standard 
spectroscopic  devices,  viz  I.R.,  U.V.  and  visible  spectrophotometry. 
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TABLE  i 


Compound  Khlnchine  Function 

n-butane  0,926614 

2- methyl propene  0.812125 

t-2-butene  1.041750 

3- methyl -1 ,2-butadiene  1,215960 

1.3.5- hexatriene  1,339620 

1 .5- hexadiyne  1,605300 

3-heptyne  1,379110 


TABLE  2 

Use  of  Divergence  to  Resolve  Non-unique  Khlnchine  Functions 

Compound  Entropy  Divergence 

2,4-hexadiene  0.473  10,97 

3-methyl -1 ,3-pentadiene  0,474  26,19 

2-ethyl -1, 3-butadiene  0.497  7,79 

1 ,3-hexadiene  0,498  9.17 


354 


TABLE  3 


Reference  Compound  for  the  Senes  is  Hexane 


Comparison  Compound 
n-hex-l-ene 

2 - methyl pent-2-ene 
cyclohexane 

3- hexyne 
2-methyl  pen tene 

1- hexyne 

2- hexyne 


Exaropl 
m/e  to  be 

Mass  ranges  encoded 


23-29 

24 

30-36 

32 

37-43 

43 

44-50 

0 

Divergence(J) 

4.2269 

7.7805 

10.0238 

11,8302 

12,5054 

18.7163 

25,8092 


TABLE  4 

; of  Octal  Coding 
Position 

of  m/e  in  Binary  Octal 

octet  Code  Code 

2 010  2 

3 Oil  3 

7 111  7 

0 000  0 
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TABLE  5 


Array  of  Masses  According  to  Position  in  Spectrum  Grouping* 


Position  in  Group** 

Group 

Number 

1 

2 

3 

4 

5 

6 

7 

1 

23 

24 

25 

26 

27 

28 

Q 

2 

30 

© 

32 

33 

34 

35 

36 

3 

37 

38 

39 

40 

41 

42 

43 

4 

44 

© 

46 

47 

48 

49 

50 

5 

51 

52 

53 

54 

55 

56 

0 

6 

58 

© 

60 

61 

62 

63 

64 

7 

65 

66 

67 

68 

69 

70 

© 

8 

72 

0 

74 

75 

76 

77 

78 

9 

79 

80 

81 

82 

83 

84 

0 

10 

86 

0 

88 

89 

90 

91 

92 

11 

93 

94 

95 

96 

97 

98 

99 

♦Circled  masses  indicate  characteristic  Ion  series, 
**See  Figure  6. 
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Table  6 Octal  Code  Patterns  Generated  by  Characteristic 

Ion  Series 


ALKYL  IONS  AMLd.  10.  ,o 

e * 


0* 

m/e 

15  29 

43 

57 

71  etc. 

V 

m/e 

00 

44 

r r. 
.9  C 

etc 

Cl 

d 

code 

7 7 

7 

7 

7 

c 

n 

code 

1 

1 

1 

ALIdlNL  5 

CYC'LOALKYL 

IONS 

StiLRJK  IONS 

e 

0 

tii/e 

27  41 

55  09 

£3  etc. 

V 

m/e  33  47  61 

75  etc 

vi 

c 

d 

code 

5 5 

5 5 

5 

n 

code  444 

4 

ALCOHOL  r 

CTFLP  I T.'S 

e 

V 

i:i/e 

31  45 

59  73 

etc. 

1 1 

code 

2 2 

2 2 

•'Indicates  odd  or  even  numbered  groups  (see  Table  2) . 
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TABLE  7 


PATTERN  FOR  ALCOHOLS 


PATTERN  FOR  ESTERS 


Compel , Compd , 


A 

7272 

A 

7202 

03 

B 

5272 

02 

B 

7271 

0203 

C 

7262 

72 

C 

7272 

04620 

3 

D 

7272 

52 

D 

7272 

04620 

3 

PATTERN  FOR  ALDEHYDES 


Compd , 


A 7131 

B 7271 

C 7251 

D 7051 


7 

7071 

71717  1 

71707  1 


358 


FOOD  SCIENCES  LABORATORY 

Moss  Spectral  Data  Acquisition  System 


Fig.  2 


-n  = 2 p-,  log  P-, 
i=1 


Fig.  3 


n Pij  n p2i 

J(1,2)  = N1  2 (P|j  - Pj)  loge  — + N2  p(p2i  - Pj>  >°9e  — 


Pli  + P2i 


ou  jjq 


I 


CD 


10 


no 


<D 

E 
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Recursive  Digital  Filtering  Applied  to  a Mini-Computer 
Data  Acquisition  System  Proposed  for  Army  Wind  Tunnels 


R.  P.  Reklis 

U.S.  Army  Ballistic  Research  Laboratories 
Aberdeen  Proving  Ground,  Maryland 


A recursive  digital  filtering  scheme  designed  for  use  with  wind 
tunnel  data  will  be  discussed.  The  filter  has  been  designed  in  such 
a fashion  that  it  will  not  distort  low  order  polynomial  data.  This 
filter  is  to  be  used  in  a proposed  mini-computer  based  data  acquisition 
system.  This  system  will  automate  several  functions  required  in  wind 
tunnel  data  taking  and  will  gather  and  display  data  graphically.  Its 
design  will  be  discussed  briefly. 


363 


INTRODUCTION 


Wind  tunnel  data  are  composed  of  a series  of  data  sets.  Each  set  is  made 
up  of  several  runs.  A run  is  composed  of  data  taken  while  a spinning  model 
slows  down  or  while  a model  is  swept  through  angle  of  attack.  Data  are  often 
linear  with  spin  rate  or  angle  of  attack  and  are  generally  fit  with  a low 
order  polynomial.  At  the  end  of  a run  calibration  is  checked  and  another  run 
is  made  at  an  altered  angle  of  attack  or  roll  position.  A data  set  is  ended 
when  it  is  necessary  to  stop  the  flow  of  air  in  the  tunnel  in  order  to  make 
further  alterations  in  the  test. 

A data  system  has  been  proposed  that  will  automate  all  of  the  action 
which  takes  place  during  a set  of  runs.  When  the  set  is  complete  data  will 
be  plotted  for  use  by  the  test  engineer.  Data  will  be  processed  in  the 
following  manner.  A tape  will  be  made  of  digitized  data  in  nearly  the  form 
they  are  produced  by  the  analog  to  digital  converter.  This  tape  will  be  made 
on  line  for  future  analysis  on  the  main  BRL  computer  and  for  possible  playback 
through  the  data  acquisition  system.  Data  will  be  simultaneously  filtered 
digitally  and  stored  in  core  buffers.  At  the  end  of  each  run  data  will  be 
taken  from  these  core  buffers  and  written  on  disc.  At  the  end  of  a series  of 
runs  data  will  be  plotted.  The  system  will  allow  data  from  any  run  in  the 
entire  series  to  be  plotted.  It  will  be  possible  to  overlay  graphs  and  to 
alter  the  scale  of  plots  from  the  keyboard.  The  hardware  for  this  mini- 
computer system  is  diagrammed  in  Figure  1.  It  consists  of  a Harris  6024/5  CPU 
which  is  currently  owned  together  with  various  peripherals.  The  Astrodata 
seen  in  Figure  1 refers  to  the  data  acquisition  system  currently  in  use. 

It  is  a hard  wired  system  that  writes  data  on  magnetic  tape.  The  fifty 
instrumentation  amplifiers,  multiplexer  wiring,  and  A/D  converter  contained 
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In  this  system  will  be  carried  over.  The  tunnel  control  subsystem  sets 

switches  which  control  tunnel  operation.  All  other  peripherals  are  standard. 

The  Astrodata  amplifiers  described  above  include  switch  selectable 

Butterworth  filters.  It  is  desirable  to  increase  the  variety  of  filtering 

available  and  to  allow  the  selection  of  filtering  after  test  data  are 

collected.  This  last  feature  in  particular  suggests  digital  filtering. 

The  scheme  selected  must  use  a minimum  amount  of  both  memory  and  time. 

A recursive  scheme  seems  best  suited.  It  is  in  addition  necessary  that  the 

filter  not  distort  data  that  may  be  fit  with  a low  order  polynomial. 

This  paper  begins  with  a review  of  some  background  material.  Digital 

filtering  is  developed  on  this  background  that  fits  the  needs  described 

above.  Random  noise  effects,  start  up  effects,  numerical  accuracy,  and 

deviation  from  the  limiting  continuous  behavior  are  then  discussed.  Further 

background  material  is  available  from  several  references.  Reference  1 

contains  reprints  of  important  papers  and  provides  a good  review  of  the 

2 

subject.  The  paper  by  C.  M.  Rader  and  B.  Gold  is  of  particular  interest. 

3 

A related  discussion  of  polynomial  smoothing  is  given  by  Blackman  . 
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SOME  USEFUL  CONCEPTS 


Filtering  Properties  of  Derivatives 

It  is,  of  course,  obvious  that  the  derivative  of  a jittery  function  is 
likely  to  be  even  more  jittery.  It  should  not  be  surprising,  therefore, 
that  the  analysis  of  many  filters  leads  to  differential  equations  in  which 
the  derivatives  are  taken  of  the  filter  output,  which  is  to  be  smoothed  to 
obtain  the  filter  input,  which  is  rough.  For  example,  the  differential 
equation  that  describes  the  common  resistor-capacitor  filter  is, 

CdWdt  + WR  =VR- 

Such  an  equation  describes  a method  by  which  the  filter  input  may  be  obtained 
from  the  output.  Clearly  it  is  desirable  to  invert  this  equation  to  a form 
that  gives  the  filter  output  from  the  filter  input.  Such  an  inversion  leads 
to  the  Green's  function  and  to  the  transfer  function  of  the  problem. 

.Transfer  Function  and  Green’s  Function 
The  differential  equations  that  describe  many  filters  have  the  form 


n=0  3,1  dtn  V°ut  = Vin  * 


0) 


where  the  an's  are  constants  and  the  zeroth  derivative  is  taken  to  be  one. 
The  operator  in  Eq.  (1)  applied  to  VQu^.  is  linear  and  for  this  reason 
solution  pairs  V^,  Vin  have  the  property  that  + V^,  V.^  + V.^ 

is  a solution  pair  if  VQUt  , V^,  and  VQUt  , V-n  are.  Thus,  Eq.  (1)  can 
be  rewritten  by  expressing  VQUt  and  Vin  in  terms  of  some  complete  set 
of  functions  and  solving  for  each  component.  This  is  usually  done  by 
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expressing  VQUt  and  V-n  as  Fourier  transforms  since  this  procedure 
diagonalizes  the  problem,  that  is. 


an  (iw)n] 


e1ut 


in  which  ^ an  (iu)n  is  the  eigenvalue  for  the  eigen  function  eiwt. 
The  expression. 


17  £n  an  (iu)n  * 
n=0  n 


is  known  as  the  transfer  function  since  if  a signal  eiwt  is  input  to  the 
filter  the  output  will  be. 


•ut/  i »„  o.)n , 

n=0  n 


and,  thus,  Eq.  (1)  has  been  inverted  in  frequency  space. 

A return  from  frequency  to  time  space  can  be  made  by  carrying  through 
the  inverse  Fourier  transforms.  Defining  the  Fourier  transform  pair  as: 

« 

F (t)  = 1//27  J f (w)  e1wt  /. 

— 05 
00 

f (»)  = 1/^7  J F (t'  ) e'Ut'  dt'  (■ 

-.CD 
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gives, 


Vout  (t)  - d/2.)  / df  V,  (f  ) {/  do.[e1“(t-t’  > / t a„  (1»)"]} 

v ' —CO  —>CQ  n-u 


This  expression  is  of  the  form 


\ut  (t-  ) 9 (t  - f ) , 


Where  the  Green's  function  g (t  - t'  ) is  given  by 


g (t  - t'  ) = (1/2 


00 

it)  J . dw^e1'1 


,(t-t'  ) 


N 

/ z a 
n=0 


„ <w] 


(4) 


The  integral  in  Eq.  (4)  can  be  readily  evaluated  by  use  of  the  residue 
thereon  if  the  zeros  of  the  polynomial. 


N 


PN  ’ n?0  *n  *"  • 


(5) 


are  known. 

For  a physically  realizable  filter  the  Green's  function  must  be  zero 
for  t < t'  as  the  filter  cannot  exhibit  a response  to  events  in  the  future. 
It  is  v/orthwhile  noting  that  this  will  only  be  the  case  if  the  polynomial  in 
Eq.  (5)  has  no  zeros  in  the  right  half  plane. 

The  Aliasing  Problem 

A digital  filtering  scheme  acts  on  a finite  number  of  discretely  spaced 
data  points  to  which  the  continuous  analysis  developed  above  does  not 
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directly  apply  and  it  is  necessary  to  carry  over  the  various  equations  to  the 
discrete  case.  Equation  (1)  for  example  can  be  rewritten  in  terms  of  finite 
differences  or,  in  general,  as, 

N 

Z am  H (n  - m)  = D (n),  (6) 

m=Q 

where  H is  the  output  of  the  filter,  D is  the  data  set  and  the  a 's  are 
constants.  Such  an  equation  is  linear  and  Fourier  analysis  applies.  The 
transform  pair  becomes, 

M 

D (n)  « (1//2M+T)  E d (k)  exp  [2Trikn/(2M+l )]  , (7) 

k=-M 

M 

d (k)  = (l/^ii+T)  E D (n' ) exp  [-2nikn'/(2M+l)]  , (8) 

n'  =-M 

for  2M+1  data  points. 

This  form  should  be  very  familiar  to  anyone  with  solid  state  experience; 
a crystal  lattice  is  a discrete  space  and  similar  forms  are  used.  Just  as 
the  finite  nature  of  the  lattice  folds  momentum  space  in  upon  itself  giving 
rise  to  the  Brillouin  zone,  frequency  space  is  folded  in  the  digital  filter 
problem.  For  example,  there  is  no  way  to  distinguish  sixty  Hz  noise  from  a 
D.C.  signal  if  data  are  sampled  at  sixty  Hz  (see  Figure  2)  as  sixty  Hz  is 
folded  into  DC  at  this  sample  rate.  This  phenomenon  is  known  as  aliasing 
and  implies  that  a data  rate  must  be  selected  with  care. 

The  relationship  between  the  continuous  and  discrete  Fourier  transforms 
is  obtained  by  sampling  a continuous  function  whose  transform  is  known  and 
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calculating  the  discrete  transform  Eq.  (9)  , by  combining  Eq.  (2)  with 
Eq.  (8). 


d(k)  = (1//2H+T) 


Z f f(w)  e111)nT  J exp  [2ni kn/(2M+l ) 

n=-M  UL  -»  J 


. 0) 


where  t is  the  sampling  period.  This  equation  simplifies  to: 


d(k)  = (l//2w(2M+T))  / dwj  f (u)  sin  C(2M+1)(wt  - 27rk/(2M+l))  /2]  / 

-03  ' 

(10) 

sin  [(ux  - 2irk/(2M+l)/2]^  . 


The  implications  of  Eq.  (10)  are  that  the  discrete  coefficient  samples 
the  continuous  coefficients  in  a window  of  half  width  2ir/(2M+l)  about 
w = 2irk/x(2M+l ) . A word  of  caution  is  urged  in  this  interpretation,  however. 
A noisy  signal  with  components  at  frequencies  much  higher  than  the  data 
sampling  rate  will  produce  an  essentially  random  data  component.  Clearly,  if 
the  D (n)  in  Eq.  (8)  are  random  the  Fourier  components  d (k)  will  also 
be  random.  The  problem  of  obtaining  useful  numerical  frequency  spectra  is 
discussed  in  reference  4.  As  long  as  the  Green's  function  is  broad  enough  to 
include  many  data  points  and  as  long  as  the  band  pass  is  much  wider  than 
2it/(2M+l ) statistical  effects  will  average  out.  This  is  demonstrated  below. 

Linearity 

These  filters  are  designed  for  use  with  data  that  approximates  a low 
order  polynomial  in  addition  noise  at  particular  frequencies  may  be  present 
as  well  as  a random  variation  in  the  data  points.  Finally,  the  solution  to 
the  equations  that  describe  these  filters  is  an  initial  value  problem  and 
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the  response  depends  on  starting  conditions.  Fortunately,  these  equations 
are  linear  and  the  response  to  each  of  these  signal  components  may  be 
treated  separately. 

SHIFT  FILTERS 

The  result  of  applying  the  operator  0, 

0 =[  E (an/n!)  dn/dtnl,  (11) 

Ln=0  J 

to  a polynomial  in  t of  order  N is  to  shift  the  polynomial,  without 
distortion,  by  an  amount  a along  the  t axis;  that  is. 


r N n N 

E t | = E cm  (t  + a)m 
Lm=0  m j riN)  m 


where  a and  the  c are  constants. 

hi 

The  transfer  function  for  this  operator  is, 

l/£  E^  (iaw)n/n!)J,  (12) 

which  for  small  aw  is  just  and  has  magnitude  one.  The  band  pass  for 

a filter  described  by  Eq.  (11)  will,  therefore,  be  flat  for  small  w.  For 
large  to  only  the  last  term  in  the  polynomial  in  Eq.  12  need  be  kept  and  the 
filter  will  cut  out  high  frequency  as  N!/(ow)  . A filter  described  by  this 
operator  will  have  the  desirable  property  of  a flat  band  pass  and  will  shift 
without  distortion  a data  signal  that  approximates  a low  order  polynomial. 
Unfortunately,  only  filters  N * 1,  4 are  realizable  as  the  polynomial, 

N n 

PN(X)  = I Xn/n!  , (13) 

" n=0 


has  zeros  in  the  right  half  plane  for  N > 4. 
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The  band  pass  curves  of  the  first  four  filters  in  this  series  are  shown 
in  Figure  3.  These  have  been  adjusted,  by  proper  choice  of  a,  so  that  they 

have  the  same  3 db.  cut  off  frequency.  The  relationship  between  a and  the 

3 db.  cut  off  frequency  is  given  in  Table  I. 

The  Green's  functions  of  the  first  four  filters  can  be  obtained  from 

the  residue  theorem  and  Eq.  4 as  stated  above  and  can  be  shown  to  be, 

N M 

- - (N!/o)  e exp  n (t  - t )/<*]  /z"  , t > tf  , 

04) 

l 0 , t < 1/  , 

where  the  Zn's  are  the  zeros  of  the  polynomial,  Eq.  13.  These  zeros  are 
plotted  in  Figure  4. 

Note  should  be  made  that  the  filters  for  N = 1 and  2 are  Butterworth 
filters;  however,  the  filters  for  N = 3 and  4 are  not. 

NUMERICAL  SIMULATION 

The  desirable  feature  of  this  series  of  filters  is  that  low  order 
polynomial  data  is  not  distorted.  This  is  the  feature  that  led  to  the 
development  of  the  operator  in  Eq.  11  and  it  can  be  used  to  simulate  the 
operator  numerically.  Rearranging  Eq.  6 to  solve  for  the  last  output  of 
the  filter  H(n)  in  terms  of  the  previous  outputs  and  the  current  datum 
point  D(n)  gives, 


N 

H(n)  = Aq  D(n)  + E Am  H(n-m)  . (15) 
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This  is  the  useful  form  of  the  filter  algorithm  and  gives  the  current 
filter  output  in  terms  of  past  filter  outputs  and  current  data.  The  A's 
can  be  solved  for  by  insisting  that. 


D(n)  3 H(n+a)  , 

06) 

when 

N 

H(n)  - e a.  n*  . 

- ft  V 

07) 

i=0  * 

Equations  16  and  17  imply  that  the  filter  algorithm  given  by  Eq.  15  will 
have  the  appropriate  shifting  property  for  polynomial  data.  Applying 
Eq.  16  and  Eq.  17  to  Eq.  15  leads  to  the  set  of  equations, 

. N 

1 * A0  (1+a)4  + t Am  (1-mr  , (18) 

m=l 

a - 0,  1,  ....  N, 
which  can  be  solved  f.or  the  A^'s  giving, 

A0  « N!/[(l+a)(2+a)...(N+a)]  , (19) 

Am  a - [a/(m+a)]  (-l)m  N!/[m!  (N-m)  !]  . (20) 

The  connection  between  the  discrete  operator,  Eq.  15,  and  the  continuous 
operator,  Eq.  11,  can  be  seen  by  writing  Eq.  15  with  the  coefficients  given 
in  Eq.  19  and  Eq.  20  in  terms  of  difference  operators  as, 

N 

D(n)  = H(n-N)  + z C (N+a) (N-l+a)  — (N- 5,+1+a)/*. ! ] y * H(n-Nn). 

£=1 
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where 


Jt 

/ H(n)  = Z (f)  (-l)k  H(n-k) . 
k=0  K 

For  large  a this  becomes, 

D(n)  = I [(ax) ] V*  H(n-Nn)//  . 

1=0 

Let  H sample  the  function  F(t)  at  increments  of  t and  fix  ax  3 a a constant 
then  by  definition  the  right  hand  side  of  the  above  equation  becomes, 

N . „ 

Z {a/ll)  d*  F(t)/dtx  , 

as  x approaches  zero. 

Thus,  the  discrete  filter  given  in  Eq.  15  with  coefficients  given  by 
Eq.  19  and  Eq.  20  is  equivalent  to  the  continuous  operator  given  in  Eq.  11 
in  the  limit  of  small  sampling  period  with  a = ax. 

Such  equivalence  is  often  shown  by  fitting  the  discrete  data  points 
with  a continuous  function  of  some  specific  form  that  eases  the  translation 
from  continuous  to  discrete  equations  maintaining  the  band  pass  characteristics 

5 

of  the  filter  . This  was  not  done  in  translating  the  shift  filter,  since  the 
property  that  remains  constant  in  this  translation  is  the  shift  property 
and  not  the  band  pass. 

RANDOM  NOISE 

The  average  response,  H(n) , of  the  filter  given  in  Eq.  15  to  random 
data  of  distribution. 
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e_(D/o)2  h /ir  , 

is  zero.  The  average  squared  response  gives  a measure  of  the  randomness  of 
the  filtered  data  and  is  given  by, 

H(n)2  = D(n)2  [ E g2  (n-&)1  , 

U=0  J 

where  g (n-*.)  is  the  discrete  Green's  function,  i.e.  the  response  H(n)  to 
D(n-Jt)  s 1 and  D otherwise  zero.  The  random  noise  attenuation  factor, 

s g2  (n-A)  , 

can  be  calculated  in  the  large  a limit  from  the  continuous  Green's  functions 
and  becomes, 

- [<N!)2/a]  j,  X,  [,/zk  & (zk + zkk>]  • <2,> 

The  value  of  these  factors  are  given  in  Table  II  while  Figure  5 shows  a plot 
of  the  attenuation  factors  for  small  a. 

START  UP 

As  mentioned  previously  the  filter  output  depends  on  starting  conditions. 
As  may  be  seen  in  Eq.  14  the  magnitude  of  the  Green's  functions  damps  out 
exponentially  and  consequently  starting  conditions  may  be  ignored  after  a 
certain  startup  time  that  goes  as  a divided  by  the  real  part  of  the  zero  of 
the  polynomial  in  Eq.  13  that  is  closest  to  the  imaginary  axis.  These 
damping  factors  are  given  in  Table  III.  Clearly,  some  improvement  may  be 
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made  if  some  scheme  can  be  used  to  start  the  filter  algorithm  by  assigning 

0 

H(1-m),  M - 1,  2...N  some  appropriate  values.  This  may  be  done  by  setting 
all  H(l-m)  equal  to  D(l)  or  by  curve  fitting  the  first  few  data  points  and 
extrapolating  backwards,  etc. 


NUMERICAL  ACCURACY 

The  response  of  the  digital  shift  filters  to  a signal  D(n)  = 1 may  be 
calculated  from  Eq.  15  as. 


H(n)  = A°  V,  fl"’  (22) 

m=l 


which,  by  Eq.  18,  is  H(n)  « 1.  Clearly,  the  sum  in  Eq.  22  represents  the 
D.C.  gain  of  the  filter.  It  can  be  shown  that  the  values. 


Gi  k=  Ao  a*"k  + z Am  *!/k!  U"k)!  * k < * * 

* m=l 

k 

give  a measure  of  the  distortion  in  the  form  of  an  n term  present  in  the 
filter  output  when  nA  is  input.  Note  that  the  G 's  reduce  to  the  gain 
calculated  from  Eq.  22.  The  calculation  of  these  distortion  terms  gives  a 
measure  of  the  numerical  accuracy  to  be  expected  from  the  algorithm. 

SMALL  SHIFT  BEHAVIOR 

Due  to  the  aliasing  problem  the  band  passes  of  digital  filters  are 
periodic  in  u>  with  period  2tt/t.  It  is,  therefore,  necessary  to  use  them 
in  conjunction  with  some  electronic  filtering  to  insure  that  signal 
components  of  frequency  m > tt/t  are  minimal.  The  data  acquisition  system 
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used  with  the  wind  tunnel  application  uses  a second  order  Butterworth  filter 
with  a cut  off  frequency  set  at  w » tt/t.  A choice  of  a > 1 will  then 
insure  that  final  filter  output  is  predominantly  controlled  by  the  digital 
filter.  The  question  then  arises  as  to  whether  the  large  a approximation 
is  appropriate  for  a > 1.  The  random  noise  attenuation  factor  being  the 
integral  of  the  square  of  the  Green's  function  serves  as  a good  single 
parameter  for  a study  of  the  closeness  to  which  the  behavior  of  the  filter 
is  approximated  by  the  large  a formulas.  As  can  be  seen  from  Figure  5 
differences  can  be  expected  and  the  small  a band  passes  and  Green's  functions 
must  be  calculated. 

The  band  pass  may  be  obtained  from  the  numerical  transfer  function, 

f1  ' ^ Am  Gxp  ("iwTm)]  /A0  » 

(d  = 2 tt  i k/t  (2M+1),  k = 

The  attenuation  curves  are  plotted  in  Figure  6 for  a « 1 and  in  Figure 
7 for  a = 5.  Figure  8 shows  the  3 db.  roll  off  frequency  as  a function  of 
u>;  it  is  multiplied  by  a to  show  the  approach  to  the  large  a limit. 

The  small  a Green's  functions  are  easily  obtained  by  applying  a pulse 
to  the  filter,  i.e.  D(l)  = 1,  D(n)  =0,  n *=  1.  The  variable  of  interest 
when  applying  the  filters  is  the  width  of  this  Green's  function  since  this 
determines  the  time  necessary  for  errors  in  the  initial  conditions  to  damp 
out.  These  widths  are  plotted  in  Figure  9.  The  criterion  used  in  calculating 
them  is  that  integral  of  the  square  of  the  Green's  function  has  reached 
ninty  percent  of  its  limiting  value.  For  practical  purposes  the  width  of 
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the  Green's  function  is  about  twice  the  value  calculated  by  this  criterion, 
the  factor  of  two  arising  from  the  fact  that  the  Green's  function  was 
squared. 

The  3 db.  roll  off  frequency  and  random  noise  factors  for  small  a are 
given  in  Tables  IV  and  V. 


CONCLUSION 

The  filter  algorithms  discussed  in  this  paper  have  been  linear  in 
nature.  That  is  the  action  of  the  filters  on  various  data  components 
may  be  analyzed  separately.  As  has  been  shown  the  main  data  signal  will 
be  preserved  without  distortion  if  it  is  in  the  form  of  a small  order 
polynomial.  The  attenuation  of  noise  at  particular  frequencies  has  been 
discussed.  In  general,  the  band  pass  becomes  wider  with  increasing  filter 
order  and  a particularly  noticeable  spike  or  resonance  is  observed  in  the 
fourth  order  filter  which  may  pose  a problem  if  noise  is  present  at  this 
frequency.  The  fourth  order  filter  will  also  pass  more  random  noise  than 
the  others,  and  it  is  also  true  that  the  fourth  order  filter  will  require 
a greater  amount  of  time  to  damp  out  starting  effects  or  noise  spikes. 

All  four  filter  types  are  useful  if  consideration  is  taken  of  the 
peculiarities  found  in  order  four,  however.  Figure  10  shows  the  effects  of 
all  four  orders  on  a sample  of  wind  tunnel  data  containing  a large  60  Hz 
noise  component.  Note  that  this  component  appears  at  20  Hz  due  to  the 
40  Hz  sampling  rate  and  is  eliminated  by  the  filtering. 
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Table  I 


Attenuation  Factors  and  3 db.  Cut  Off  Frequencies  for  Shift  Filters 
Order  Attenuation  Factor  3 db.  Frequency  (Hz) 


1 

1//1  + (tua)^ 

f3  = .276/a 

2 

1//1  + (<oa)4/4 

.296/a 

3 

1//1  - (ujct)4/12  + (ioa)6/36 

. 390/a 

4 

1//1  - (aia)6/72  + (u>a)8/576 

.499/a 

Table  II 

Random  Noise  Attenuation  Factors  for  Large  a Filters 


Filter  Order 
1 
2 

3 

4 


Attenuation  Factor 
.5  /a 
.5  /a 
.75/a 
1.5  /a 


Table  III 

Green's  Function  Damping  Factor 

N.  Factor 

1 - 1 

2 - 1 

3 - 1.42 

4 - 3.69 
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Table  IV 

Band  Edge  Frequencies  (Hz)  for  Small  a Filters  (3  db) 


Data  Rate  = 40  Hz  for (120  Hz  multiply  by  3) 


a 

1 

Filter  Order 
2 3 

4 

— 

i 

8.40 

6.68 

7.07 

7.64 

2 

4.62 

4.14 

4.71 

5.35 

3 

3.23 

3.03 

3.59 

4.18 

4 

2.48 

2.40 

2.90 

3.42 

5 

2.02 

2.00 

2.43 

2.92 

6 

1.71 

1.71 

2.10 

2.54 

8 

1.31 

1.32 

1.66 

2.01 

10 

1.05 

1.08 

1.36 

1.67 

12 

0.89 

0.91 

1.16 

1.43 

14 

0.76 

0.79 

1.01 

1.25 

16 

0.67 

0.70 

0.89 

1.11 

18 

0.60 

0.63 

0.80 

1.00 

20 

0.54 

0.56 

0.73 

0.91 
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Table  V 


Random  Noise 

Attenuation  for  Small 

a Filters 

Filter  Order 

l 

2 

3 

4 

1 

.33 

.29 

.33 

.47 

2 

.20 

.18 

.22 

.36 

3 

.14 

.13 

.17 

.29 

4 

.11 

.10 

.14 

.24 

5 

.091 

.087 

.12 

.21 

6 

.077 

.072 

.10 

.19 

8 

.059 

.056 

.079 

.15 

10 

.048 

.046 

.066 

.12 
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Figure  1.  Block  Diagram  of  Data  Acquisition  and  Tunnel  Automation  System 
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Figure  3*  Attenuation  Curves  Adjusted  to  Have  the  Same  Band  Pass 


Figure  4.  Zeros  of  z Xm/m! 

m=l 
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Attenuation  Factor 


o i/r 


Frequency 


Figure  6.  Attenuation  Curves  for  a = 1 
387 


Figure  7.  Attenuation  Curves  for  a =5 
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Extent  of  the  Green’s  Function /a 


Filter  Order 


Fi aure  9 Width  of  the  Small  a Green's  Functions  Divided  by  a to  Show  the  Approach  to  the  Large  a 
Figure  9.  J!^t^weTJ™‘^5e^nused  t0  determine  width  is  that  the  integral  at  the  square  of 

& Kin's  fuSctlM  ^obtained  ninety  percent  of  its  limiting  value. 
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II.  Leo  Butler 

U.  5.  Army  Waterways  Lxperiment  Station 
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AMH  Department 
University  of  Alabama 
University,  Alabama  3S486 

ABSTRACT.  Two  numerical  models  for  simulating  the  tidal  hydrodynamics 
of  an  inlet,  bay  or  harbor  are  compared.  An  implicit  finite  difference 
solution  to  the  basic  equations  of  hydrodynamic  flow  has  been  developed 
at  the  Waterways  Experiment  Station.  This  implicit  model  is  compared  with 
an  explicit  formulation  as  applied  by  Masch,  1973,  Both  models  calculate 
depth  averaged  velocity  components  and  tidal  elevations  as  a function  of 
position  and  time  during  a specified  tidal  cycle.  In  addition  to  the 
actual  bathymetry,  the  two  models  include  variable  bottom  roughness,  non- 
linear advective  terms  in  the  momentum  equations,  treatment  of  regions 
which  are  inundated  during  a portion  of  the  tidal  cycle,  exposed  and 
submei'ged  barriers,  wind  stress,  and  other  physical  features  of  the  region 
to  be  modeled.  A discussion  of  the  mathematical  formulation  and  associated 
finite  difference  approximations  is  included.  The  comparison  consists  in 
applying  both  models  to  Masonboro  Inlet,  North  Carolina,  with  identical 
bathymetric  data,  boundary  conditions,  and  spatial  step  size.  The  hydro- 
dynamic  solutions  obtained  are  compared  as  well  as  the  economics  associated 
with  the  two  models.  While,  in  general,  the  solutions  obtained  from  the 
two  schemes  are  comparable,  the  explicit  solution  has  a considerably  more 
stringent  stability  criterion  limiting  the  time  step.  Remedial  actions 
required  to  overcome  stability  problems  inherent  in  implicit  schemes  are 
discussed. 


1 . INTRODUCTION.  Most  numerical  simulations  of  tidal  hydro- 
dynamics associated  with  inlets  and  bays  have  been  performed  using 
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finite  difference  schemes  based  upon  an  explicit  formulation.  A 
code  employing  such  a scheme  was  developed  by  Reid  5 Bodine  (1968) 
and  used  extensively  in  treating  problems  which  include  possible  flooding 
of  low-lying  areas  and  subgrid  topographic  features.  An  extension  of 
this  work  was  carried  out  by  Masch,  Brandes,  and  Reagan  (1973)  for 
the  U,  S.  Army  Coastal  engineering  Research  Center  to  help  evaluate 
the  degree  to  which  mathematical  models  can  be  used  to  predict  the  tidal 
hydrodynamics  (exclusive  of  sediment  transport)  of  an  inlet  system. 

The  Wave  Dynamics  Division,  WES  Hydraulics  Laboratory,  has  applied 
implicit  finite  difference  schemes  to  a variety  of  problems  such  as 
tsunami  propagation,  simplified  tidal  models,  landslide  generated  water 
waves  and  storm  surge  calculations  (Butler  and  Durham,  197S) . Implicit 
schemes  have  been  applied  successfully  by  Leendertse  (1970,  1971)  to 
regions  which  include  areas  that  are  inundated  only  during  a portion 
of  the  tidal  cycle.  The  present  work  is  an  extension  of  the  ideas 
expressed  by  Leendertse  but  differs  in  that  the  basic  equations  are 
written  in  terms  of  vertically  integrated  flows  per  unit  of  width 
rather  than  velocity,  and  subgrid  features,  such  as  exposed, submerged, 
and  overtopping  barriers,  are  treated. 

The  principal  reason  for  using  the  implicit  formulation  is 
economic.  Explicit  schemes  are  generally  hampered  by  a stringent 
restriction  on  the  time  step  used  in  the  computational  procedure.  For 
large  regions  simulation  may  be  infeasible.  Normally,  implicit  schemes 
do  not  have  such  restrictions  and,  therefore,  can  be  applied  using 
a significantly  larger  time  step. 

The  method  of  comparison  consists  in  applying  both  implicit 
and  explicit  codes  to  a seven  and  one  half  square  mile  area  at 
Masonboro  Inlet,  North  Carolina.  Comparisons  of  surface  elevations 
and  depth  averaged  flows  with  prototype  data  are  made  for  each  scheme. 
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2.  Theory- -Pquat  ions  of  fluid  Flow,  The  hydrodynamic  equations 
used  in  this  work  are  derived  from  the  standard  three-dimensional 
Navier-Stokcs  equations.  By  assuming  the  vertical  accelerations  are 
small  and  the  fluid  is  well  mixed,  and  integrating  the  flow  from 
the  sea  bottom  to  the  water  surface,  the  usual  two-dimensional  depth- 
averaged  form  of  the  equations  of  momentum  and  continuity  are  obtained: 


MOMENTUM 

BU  JJ  BU  V3U 
Bt  + d Bx  * d By 


- f v ♦ Ed  T- 

BX 


BV 

Bt 


U BV  V BV  . n 
+ - — tT-  + f U + gd 
d BX  d By 


Bt} 

By 


CONTINUITY 
a,  BU  BV 

— 1 + — + — « f 

Bt  T Bx  By  s 


U,  V:  FLOW/UNIT  WIDTH 
SURFACE  ELEVATION 


Cl) 


C2) 


(3) 


In  these  equations,  U and  V are  the  vertically  integrated  flows  per  unit 
of  width  at  time  t in  the  x and  y directions,  respectively;  n is  the  water 
surface  elevation  with  respect  to  the  given  datum;  d is  the  water 
depth  at  (x,y,t);  other  terms  are  defined  in  the  table  of  notations. 

As  evident  in  the  equations  of  motion,  advection,  coriolis  force,  bottom 
friction,  rainfall,  and  wind  forces  are  included.  Two  forms  of  finite 
difference  solutions  to  equations  (1),  (2),  and  (3)  are  considered:  an 

explicit  and  implicit  formulation.  The  computational  grid  used  in  both 
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formulations  is  identical.  A recti incar  mesh  is  placed  over  the  study 
area,  and  within  each  grid  cell  the  following  assumptions  are  made: 

(1)  the  value  of  n is  considered  to  be  an  average  over  a grid  cell 
centered  at  x = MAx  and  y - N Ay  and  at  time  kAt;  (2)  the  value  of 
UN  M is  given  at  the  center  of  the  lower  cell  face;  and  (3)  the  value 
°f  VN,  ^ is  given  at  the  center  of  the  right-hand  cell  face  (Figure  1). 


M 


0 - FLOW/UNIT  WIDTH  IN 
X- DIRECTION  (U) 

A - FLOW/UNIT  WIDTH  IN 
Y * DIRECTION  (V) 

O - SURFACE  ELEVATION  h)t 
WATER  DEPTH  (d), 

FRICTIONAL  COEFFICIENT  (C  or  n) 

Figure  1.  Cell  definition 

In  addition,  the  water  depth  d and  Chezy  frictional  coefficient  C are 
also  defined  at  the  center  of  grid  cells. 

Explicit  Solution  Scheme.  The  explicit  solution  method  used  by 
Reid  is  a time-centered  difference  scheme  involving  a procedure  of  the 
"leap  frog"  type  for  computation  of  flow  and  water  levels.  The  follow- 
ing notation  will  be  used:  k - kAt;  angle  brackets,^  ^ or  ^ ^ , to 

indicate  that  terms  maintained  in  differential  form  arc  evaluated  with 
centered  difference  expressions  over  one  or  two  grid  cells,  respectively. 
Quantities  which  are  not  specified  at  a given  spatial  location  are 
replaced  by  averaged  values  indicated  by  a bar,  such  as  Using 

these  notations,  applying  centered  differences  in  time  and  space,  and 
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k + \ k+V>  k+1 

solving  for  U ",V  ,n  in  terms  of  quantities  at  previous  time 
levels,  explicit  expressions  can  be  written  as  follows: 


MOMENTUM 


wk-l/2  /-vii \k~l/2 

Uk+1/z  = Uk-,/2  - At  — —r; — \ “ ) + AtfVk-,/Z 

o \ <>y  / 2 


Atgd*<^)  + AtFj  Ch  AT  (N,  M + 1/2) 


vk+l/2  = - At  " + 

d*  \ ax  /2 


WHERE 


CONTINUITY 


- Atgd*  / + A tFy  / C,y  AT  (N  + 1/2,  M) 


r At/ayV^72  Bn„At  ^ 

,x  d*\3x/2  2.21(d*)4/3  * 

W.  =r  (Uk_l/z)Z  + (Vk',/2)*  L* 


C«*  “ 1 d*  \3y. 


Lk_1/2  gnZAt 


2 2.21(d*) 


"y 


wv  = (0k_l/2)2  + (vk'1/2)2  U* 


d*  = V - h 


AT  (N , M) 


The  calculations  for  each  time  step  are  divided  into  two  halves; 
the  flows  are  computed  during  the  first  half  of  the  time  step,  and  the 
results  are  used  in  the  continuity  equation  to  calculate  the  surface 
elevations  during  the  second  half. 
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Implicit  Solution  Scheme . To  sol vc  the  governing  equat i ons 
implicitly,  the  same  space-staggered  scheme  is  used.  The  implicit 
code  employs  an  alternating-direction  technique  whose  calculations 
are  divided  into  two  parts.  The  first  step,  or  Vcycle,  consists  in 
solving  for  n and  U implicitly;  the  second  ^-cycle  computes  n and  V 
implicitly.  The  omitted  transport  in  each  %-cycle  is  assumed  constant 
for  that  step.  Employing  a centered  difference  operator  to  the 
momentum  equation  (1)  and  the  continuity  equation  (3)  along  a grid 
line  parallel  to  the  x~axis,  results  in  a system  of  linear  algebraic 
equations  whose  coefficient  matrix  is  tridiagonal.  The  form  of  the 
equations  for  the  first  h-cy cle  is  given  by: 


MOMENTUM 


U k + 1/2  = jjk-1/2 


+ At 


tvk  - 


IJk  + 1/2 
d* 


ljk+1/2  + yk-1/2 

+ 2(Ck)2  (d*)3 

AT  (N,  M +■  1/2) 


((Uk~1/2)z  + (V*)* 


CONTINUITY 

Wy+l/2 

AT  (N,  M) 

WHERE  d*  = *jk  - h 


(12) 


(13) 


(14) 


Notations  used  in  equations  (7)  and  (8)  are  the  same  as  those  given 
in  the  description  of  the  explicit  formulation.  The  equations  for  the 
second  Vcycle  are  similar  to  the  above  and  are  not  presented, 

houndary  Conditions.  Various  types  of  boundary  conditions  are 
permissible  in  the  present  system  of  computation  for  both  explicit  and 
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implicit  codes.  In  both  codes,  boundary  conditions  at  the  tide  computa- 
tion boundary  (open  boundary  conditions)  are  accomplished  by  setting  the 
water  levels,  r)N  as  prescribed  by  input  tables.  Flow  rates  may  be 
specified  instead  of  water  levels.  All  additional  boundary  conditions 
relate  the  normal  component  of  flow  at  the  boundary  to  the  state  of  the 
water  level  at  the  boundary. 

Water-Land  Boundaries.  Such  boundaries  are  prescribed  along 
cell  faces,  hence  this  condition  is  handled  by  specifying  U = 0 or 
V = 0 for  those  cells  where  impermeable  boundaries  exist.  In  estuarine 
systems  with  large  areas  of  low-lying  terrain  and  a significant  tidal 
range,  many  areas  alternately  dry  and  flood  with  each  tidal  cycle. 

This  behavior  is  simulated  by  making  the  location  of  the  land-water 
boundary  a function  of  the  current  value  of  the  total  water  depth. 

By  checking  the  water  level  in  adjacent  cells  relative  to  the  ground 
elevation,  a determination  is  made  as  to  the  possibility  of  inundation. 

If  flooding  is  possible,  the  boundary  face  is  treated  as  open  and  compu- 
tations for  n>  U,  and  V are  made  for  that  cell.  The  drying  of  cells 
is  simply  the  inverse  process. 

Subgrid  Barriers.  Subgrid  barriers  are  defined  along  cell  faces 
and  are  of  three  types:  exposed,  submerged,  and  overtopping.  One 

characteristic  of  such  barriers  is  that  the  surface  elevation  is 
computed  at  the  center  of  cells  on  either  side  of  the  barrier.  The 
treatment  of  these  barriers  in  the  explicit  code  can  be  found  in 
Masch  (1973).  The  following  discussion  is  limited  to  the  way  in  which 
these  conditions  are  simulated  in  the  implicit  code.  Exposed  barriers 
are  handled  by  simply  specifying  a no-flow  condition  across  the  cell 
face.  This  type  of  barrier  is  used  to  describe  dykes,  jetties,  and 
similar  features  which  are  impermeable  and  usually  of  width  much  less  than 
the  spatial  grid  step.  Submerged  barriers  are  used  to  simulate  flows 
across  such  barriers  as  submerged  reefs,  spoil  banks,  pipe  lines,  etc. 

The  water  level  on  each  side  of  a submerged  barrier  must  always  exceed 
the  barrier  crest  elevation.  The  flow  over  a submerged  barrier  can  be 
controlled  in  a manner  similar  to  that  used  by  Masch,  but  experience 
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has  shown  that  by  defining  a special  Chezy  coefficient  for  the  barrier 
face,  the  flow  over  the  barrier  can  be  simulated  without  introducing 
unwanted  transients  due  to  the  use  of  the  submerged  weir  equation. 

Overtopping  barrier  is  a terminology  used  to  distinguish  a 
barrier  which  can  be  submerged  during  one  portion  of  the  tidal  cycle 
and  totally  exposed  in  another.  Masch  used  a broad-crested  weir 
formula  to  describe  the  overtopping  nature  of  the  flow  and  then  the 
submerged  weir  formula  when  appropriate.  Since  a larger  time  step 
is  used  in  the  implicit  code  and  the  duration  time  of  overtopping 
is  short,  a Chezy  formulation  is  again  used  to  simulate  the  flow 
across  the  barrier.  When  the  barrier  is  exposed  a very  small  Chezy 
coefficient  (high  friction)  is  used  to  "stop”  the  flow.  When  overtopping 
occurs,  the  coefficient  is  increased  to  a specified  maximum  as  a function 
of  the  water-level  over  the  barrier.  As  the  water  level  decreases  at  a 
later  time,  the  coefficient  is  decreased  accordingly. 

Numerical  Stability.  For  an  explicit  solution  scheme,  the  grid 
size  and  computational  time  step  are  related  through  a stability 
criterion.  The  criterion  associated  with  the  explicit  scheme  presented 
here  is  given  by  the  relation 


At  < 


As 


/2gd 


MAX 


(15) 


where  As  is  the  mesh  size  and  dWAV  is  the  maximum  water  depth  used 

MAX 

in  the  model.  This  approximate  condition  was  derived  from  expressions 
obtained  by  linearizing  the  problem.  When  the  non-linear  terms  are  in- 
cluded, it  can  be  expected  that  the  time  step  will  require  further  reduction. 

Considering  the  linearized  implicit  equations,  it  can  be  shown  that 
the  difference  scheme  is  unconditionally  stable.  In  other  words  the 

space  and  time  steps  may  be  chosen  to  meet  required  accuracy  in 

representing  topographic  features  and  external  forcing  functions.  The 

3U  3U 

inclusion  of  the  non-linear  advectivc  terms  (of  the  form  U — , V 7^) 

3x  o y 
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into  the  implicit  scheme  results  in  inherent  instabilities,  All  derivatives 

in  the  basic  equations  are  approximated  with  centered  differences  over 

a single  grid  space  with  the  exception  of  the  advective  terms,  which 

arc  computed  over  two  grid  cells.  Oscillations  of  the  water  level  at 

a grid  point  (period  4At.)  may  occur  and  grow  unbounded* 

A scheme  which  proved  capable  of  eliminating  these  instabilities 

was  the  use  of  a recursive  digital  filter  of  the  form 

nk+l  - u k k-1  _ 

n - an(c)  + bn  + cn  (16) 

r c i 

where  n represents  the  computed  water  level  and  n>  the  water  level 
value  used  in  further  computations;  coefficients  a,b,c  are  chosen  to 
filter  out  oscillations  of  period  4At  (corresponds  to  h the  Nyquist 
frequency)  and  smaller,  while  permitting  the  longer  period  wave  motion 
to  remain  undisturbed.  The  coefficients  of  the  digital  filter  must 
also  be  chosen  in  such  a way  as  to  maintain  stability  of  the  filter. 

By  applying  the  linearized  system  with  and  without  a filter  it  was 
demonstrated  that  filtering  does  not  affect  the  results.  For  applica- 
tions presented  in  this  paper,  values  of  a = 0.6,  b = 0.3,  and  c = 0.1 
were  selected. 

An  additional  instability,  which  is  termed  a "secondary  flow" 
phenomena,  may  also  occur.  A discussion  of  this  problem  was  presented 
by  Vreugdenhil  (1973).  The  scheme  normally  employed  to  eliminate 
this  instability  is  the  inclusion  of  terms  in  the  momentum  equations  of 
the  form 

e 14  + 14}.  (17) 

Such  a form  is  referred  to  as  an  eddy-viscosity  term  and  is  generally 
taken  as  a representation  of  the  effective-stress  in  vertical  planes. 

These  terms  arc  usually  neglected  but  are  very  important  when  the 
flow  has  a strong  tendency  to  converge  or  diverge  at  various  locations 
within  the  system* 
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3 . Application  to  Mnsonboro  Inlet,  North  Carolina-Computational 
Grid.  A map  of  the  Mason boro  Inlet  system  is  shown  in  Fig.  2*  A grid 
size  of  300  feet  was  adapted  resulting  in  a mesh  of  dimensions  41  X 57. 
The  computational  water  points  are  1721  in  number*  Tidal  elevations 
obtained  from  prototype  data  taken  in  a survey  on  32  September  1969 
arc  imposed  at  boundary  lines  denoted  by  circled  numbers.  The  bathemetry 
at  Masonboro  Inlet  for  this  period  of  time  was  also  surveyed;  available 
boat  sheets  were  digitized  for  the  computational  area.  The  frictional 
coefficients  are  defined  by  assigning  number  codes  to  the  various 
types  of  terrain  and  applying  known  values  of  Manning’s  n and  the 
relationship: 


C - Hr  <1/6  (18) 

Since  the  prototype  data  was  taken  on  a calm  day,  no  wind  stress  was 
applied  in  the  numerical  model* 

The  jetty  system  protruding  from  the  outer  barrier  island  is 
composed  of  a weir  section,  elevation  2'  above  datum  and  1000'  in 
length,  extending  from  the  outer  island  to  near  the  bend  in  the  jetty. 
The  remainder  of  the  structure  is  impermeable.  Exposed  barriers  were 
used  to  represent  very  narrow  strips  of  high  land  in  the  marsh  area 
behind  the  barrier  islands.  The  outer  model  boundaries  are  set  at 
a distance  of  10,000  feet  from  the  inlet  throat  to  minimize  effects 
from  inherent  problems  in  handling  all  the  terms  in  the  equations 
at  the  input  boundaries. 

Comparison  of  Results.  Although  excellent  results  were  obtained 
by  both  codes  at  gages  located  throughout  the  system,  for  brevity, 
only  results  at  the  three  locations  depicted  in  Fig,  2 will  be  presented 
for  comparison.  Figure  3 shows  the  degree  to  which  the  numerical  codes 
simulate  the  prototype  tides.  The  computations  were  begun  at  1330  EST 
and  a fold-over  occurred  at  2000  F:ST,  equating  data  at  this  hour  with 
that  taken  at  0730  EST.  This  procedure  may  cause  some  discrepancy  in 
the  results  but  is  required  since  the  models  must  spin  up  from  rest 
at  low  tide,  both  models  describe  the  tides  equally  well.  The 
discrepancy  at  gage  2 in  the  ebb  phase  may  have  resulted  from  two 
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Figure  2.  Act i a 1 extent  of  computational  grid  for 
Masonboro  Inlet,  North  Carolina 


403 


WES  IMPLICIT  | _ * REC-BODWE 


problems:  the  prototype  tide  gage  was  located  within  a marina  behind 

the  northern  barrier  island  and  there  was  some  question  as  to  the  phasing 
of  the  tidal  input  specified  at  boundary  2. 

Figure  4 shows  the  tidal  velocities  calculated  in  the  numerical 
models  relative  to  prototype  velocity  measurements  taken  at  three  depths: 
surface,  mid-depth  and  bottom*  Recalling  that  the  models  produce  a depth 
averaged  flow  rate,  velocities  arc  obtained  by  dividing  the  flow  by 
the  local  water  depth.  Again,  good  agreement  is  obtained  in  both  models. 

Figures  5~6  display  sample  circulation  patterns  at  flood  stage 
(5  Hrs  = 1830  EST)  and  ebb  stage  (10  Hrs  = 1100  EST) . The  arrowheads 
indicate  direction  of  flow  and  their  length  is  proportional  to  the 
magnitude  of  the  flow  rate. 

4.  Comparison  Statistics  and  Conclusions.  Table  1 below  relates 
the  computer  run  time  required  for  the  Masonboro  simulation  for  both 
numerical  models. 


TABLE  1.  COMPARISON  STATISTICS 


GRID 

NUMBER  OF  WATER  POINTS  1721 

SPATIAL  STEPSIZE  (FT)  300 

SOLUTION 

RUN* 

At  TIME 

(SEC)  (MIN) 

EXPLICIT  3 60 

IMPLICIT  90  4 


* BASED  ON  18  PROTOTYPE  HOUR 
SIMULATION  ON  A CDC  7600. 

The  stability  criterion  associated  with  the  explicit  scheme  predicts  a 
5.2  second  time  step  is  necessary  for  the  range  of  water  depths  appearing 
in  the  system.  However,  time  steps  of  5 sec  and  4 sec  were  tried  and 
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Figure  4.  Comparison  of  velocity  agreement  with  prototype 
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Figure  5.  Circulation  patterns  at  flood  stage 
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instabilities  resulted.  A stepsize  of  3 sec  was  found  to  be  stable 
f o r t he  on t L re  s j mu  1 at  ion.  The  i nip  1 i e j t scheme  was  run  for  time  st ep s 
of  A 5,  IK),  and  180  seconds.  Results  for  the  two  smaller  time  steps 
were  practically  identical.  Results  for  At  = 180  compared  favorably 
with  the  prototype  but  some  discrepancies  in  phasing  were  noted. 

Results  for  At  = 90  seconds  have  been  presented*  Note  that  At  for 
the  implicit  scheme  is  the  time  for  a complete  cycle,  that  is,  U and  V 
are  computed  once  and  n,  twice.  In  the  explicit  scheme,  U,  V and  n 
arc  computed  once  in  a time  step  of  3 seconds.  The  speed  of  the  implicit 
scheme  can  be  expressed  by  the  relationship 

I = ,S*i 

At  (19) 

c 

where  At.  and  At  are  the  time  steps  of  the  implicit  and  explicit  schemes, 
x e 

respectively,  and  I is  the  execution  speed  of  the  implicit  model  relative 
to  the  explicit  model.  In  this  application  a 15:1  ratio  in  execution  time 
was  noted. 

The  results  of  this  study  demonstrate  that  the  implicit  scheme 
presented  herein  can  reliably  simulate  the  tidal  hydrodynamics  of 
a complicated  inlet  system.  The  WHS  implicit  model  is  considerably 
more  economical  to  apply  and  should  prove  to  be  most  beneficial  in 
applications  to  estuarine  systems  as  well  as  other  problems  which 
require  simulation  of  long  period  wave  motion. 
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NOTATIONS 


N 


c 

a 

dMAX 

f 


w 

g 

h 

k 

M,N 


nx,ny 


u,v 


x,y 

e 

n 

5 

Ax,  Ay 

As 

At 


Chezy  frictional  coefficient 
Total  water  depth 

Maximum  water  depth  at  any  location  and  time  in  the  system 

Coriolis  parameter 

External  forcing  functions 

Acceleration  of  gravity 

Land  surface  elevation 

Time  increment  counter 

Indicies  denoting  spatial  increments  in  the  x and  y direction 
Manning  Frictional  coefficient 
Averaged  Manning* s coefficient 
Time 

Integrated  horizontal  velocity  components 
Cartesian  coordinates 
Eddy-viscosity  coefficient 

Water  surface  elevation  with  respect  to  given  datum 

Rainfall  minus  evaporation 

Spatail  increment 

General  spatial  increment 

Time  increment 
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ABSTRACT.  A numerical  model  is  developed  for  simulating 
the  development  and  propagation  of  landslide  generated  water 
waves  in  reservoirs.  The  numerical  model  is  based  upon  a 
finite  difference  representation  of  the  depth  averaged  hydro- 
dynamic  equations.  The  landslide  is  formulated  as  a moving 
boundary  condition,  propagating  into  the  reservoir  and 
accelerating  the  fluid  due  to  physical  displacement  and 
viscous  drag.  Arbitrary  reservoir  geometry  and  landslide 
parameters  can  be  considered.  The  numerical  model  results 
are  compared  with  experimental  results  obtained  on  a Is  120 
undistorted  scale  physical  model  of  Libby  Dam  and  Lake 
Koocanusa  in  Montana.  Landslides  were  considered  reflecting 
a wide  range  of  landslide  volumes  and  velocities.  The  wave 
heights  predicted  by  the  numerical  model  are  in  good  agree- 
ment with  the  wave  heights  observed  in  the  physical  model. 

1.  INTRODUCTION.  There  are  many  serious  problems 

associated  with  rockfalls  or  landslides  into  bays,  lakes, 
reservoirs,  fjords  and  rivers.  These  problems  are  becoming 
increasingly  important  due  to  expanded  use  of  these  bodies 
for  recreational  purposes  and  the  increased  industrial  and 
residential  development  along  the  shores.  Some  areas  of 
concern  created  by  potential  slide  areas  are:  loss  of  life, 

damage  to  shoreside  structures  and  boats,  overtopping  of 
dams  by  surge  or  waves  with  resulting  damage  to  the  dam 

and  spillways,  failure  of  dams  with  resulting  large  scale 
flooding,  upstream  flooding  due  to  river  blockage  and  loss 
of  usage  of  the  water  body  due  to  restrictions  imposed  by 
the  final  position  of  the  slide  material.  Examples  of  the 
occurence  of  each  of  these  can  be  found  in  the  literature 
and  vividly  indicate  the  extent  of  the  potential  problem. 

2.  BACKGROUND  REVIEW.  Some  attention  has  previously 
been  directed  toward  obtaining  a qualitative  and  quantitative 
understanding  of  the  probability  of  occurrence  and  the 
characteristics  of  water  waves  generated  by  rockfalls  or 
landslides  into  reservoirs.  Most  laboratory  and  theoretical 
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investigations,  however,  have  been  two  dimensional  studies 
in  an  attempt  to  determine  the  fundamental  relationships 
between  the  geometry  and  speed  of  a rockfall  or  landslide 
and  the  characteristics  of  the  water  waves  generated  by 
this  mechanism  (5»  7,  8). 

In  1971  the  Hydraulics  Laboratory  at  the  U.  S.  Army 
Corps  of  Engineers  Waterways  Experiment  Station  (WES) 
constructed  a 1:120  undistorted  model  of  Libby  Dam  and  Lake 
Koocanusa  and  conducted  tests  to  determine  generated  wave 
heights  and  characteristics  resulting  from  the  sliding  of 
individual  rock  ribs  into  the  reservoir  (1).  Libby  Dam, 
a 420  ft.  (128  m)  high  concrete  gravity  structure  on  the 
Kootenai  River  in  western  Montana,  is  flanked  by  high  rock 
slopes  extending  several  thousand  feet  upstream  from  the 
structure.  The  rock  is  predominately  bedded  and  jointed. 
Several  prominent  rock  ribs  form  possible  rock  slide  zones. 

The  present  study  was  directed  toward  developing 
numerical  methods  for  predicting  the  effects  of  landslide 
generated  water  waves  in  reservoirs.  The  previous  physical 
model  tests  were  used  to  provide  comparative  data. 

3.  THE  NUMERICAL  MODEL.  A two  dimensional  approach 
which  possesses  a pseudo  three  dimensional  effect  was  utilized 
in  the  numerical  investigation.  The  vertical  component  of 
velocity  is  neglected  and  the  governing  hydrodynamic  equations 
are  integrated  over  the  water  depth.  An  average  two  dimen- 
sional flow  field  is  obtained  but  three  dimensional  geometry 
can  be  considered.  This  basic  approach  has  been  used  by 
several  authors  such  as  Hansen  (2),  Leendertse  (4)  and 
Platzman  (6). 


The  rectangular  coordinate  system  used  is  located  in 
the  plane  of  the  undisturbed  water  surface  as  shown  in 
Figure  1 . The  equations  of  motion  and  the  equation  of 
continuity  are  written  as  follows: 
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In  these  equations  u and  v are  velocity  components,  n is  the 
water  level  displacement  relative  to  the  initial  reservoir 
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Figure  2.  Grid  system  and  variable  definition  location 
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surface,  h is  the  undisturbed  reservoir  depth,  x,  is  the  vertical 
bottom  deformation  created  by  the  landslide  material,  R is 
the  bottom  friction  and  L is  a direct  acceleration  effect  of 
the  landslide  on  the  fluid.  Additional  terms,  Coriolis  effect, 
horizontal  diffusion,  and  wind  stress  could  be  considered  in 
the  equations  of  motion;  however,  these  terms  were  neglected 
in  this  investigation.  The  continuity  equation  has  been 
obtained  by  integrating  across  the  water  depth  and  applying 
kinematic  and  dynamic  boundary  conditions  at  the  surface  and 
bottom  of  the  reservoir.  The  bottom  friction  terms  are. 
represented  using  a modification  of  the  normal  formulation 
of  friction  in  terms  of  the  Chezy  coefficient.  This  formula- 
tion is  necessary  to  properly  account  for  the  bottom  friction 
where  the  moving  landslid  represents  the  bottom  boundary. 


R 


x 


g (Vx  - u)[(Vx  - u)2  + (Vy  - v)2]^ 
C2(h  + n ) 


(4) 


_ g(Vy  - v)[(Vx  - u)2  + (Vy  - v)2]1  (5 

y C2(h  + ti) 

The  velocity  components  V and  V are  the  components  of  the 
landslide  velocity.  y 

To  solve  the  governing  equations  a finite  difference 
approximation  of  the  equations  and  an  implicit-explicit 
alternating  direction  technique  is  employed.  A space  stag- 
gered scheme  is  used  in  which  velocities,  water  level  dis- 
placement, bottom  displacement,  and  water  depth  are  described 
at  different  locations  within  a grid  cell  as  shown  in  Figure 
2.  A double-time-step  operation  is  used  in  such  a manner  that 
the  terms  containing  space  derivatives  are  generally  taken  as 
alternating  forward  and  backward.  The  first  step  in  the 
calculation  consists  of  computing  u and  n implicitly  and  v 
explicity,  advancing  from  time  n At  to  (n  + t)  At..  The 
second  step  computes  n and  v implicitly  and  u explicitly, 
advancing  from  time  (n  + £)  At  to  (n+l)  At.  Central 
differences  are  used  for  evaluating  all  derivatives  in  the 
governing  equations.  This  method  of  solution  has  been  dis- 
cussed in  detail  by  Leendertse  (4). 

Three  types  of  boundaries  are  involved  in  the  calcula- 
tions. These  are  the  solid  boundaries  at  fixed  coastlines, 
the  fictitious  open  boundaries  arising  from  the  need  to 
truncate  the  region  of  computation  and  the  time  dependent 
boundary  between  the  landslide  surface  and  the  water  in  the 
reservoir. 
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A condition  of  complete  reflection  is  adopted  at  solid 
boundaries.  The  condition  can  be  written  as 

v . n = 0 (6) 

at  solid  boundaries,  where  the  n denotes  a unit  vector  normal 
to  the  boundary. 

The  applicable  condition  at  fictitious  open  boundaries 
is  more  difficult  to  specify.  The  total  transmission  of 
the  wave  is  the  physical  requirement  at  this  boundary;  how- 
ever, this  cannot  be  rigorously  achieved  without  computation 
beyond  the  boundary.  As  an  approximation  to  the  desired 
physical  requirement,  the  wave  profile  is  simply  assumed  to 
travel  without  change  of  form  across  the  last  interior  grid 
cell. 


The  landslide  is  represented  by  a time  dependent 
vertical  deformation  of  the  bottom  of  the  reservoir  plus 
additional  terms  to  represent  the  effect  of  the  landslide 
due  to  viscous  and  inertia  forces.  The  bottom  deformation 
propagates  into  specified  regions  of  the  reservoir  at  the 
average  speed  of  the  landslide  with  the  deformation  at  any 
particular  location  increasing  from  zero  to  a maximum  value 
according  to  a specified  time-displacement  relationship. 

For  those  portions  of  the  bottom  of  the  reservoir  through 
which  the  landslide  passes  but  which  do  not  experience  a 
net  change  in  ground  elevation,  the  deformation  is  allowed 
to  return  to  zero  at  a specified  rate.  The  handling  of  the 
landslide  condition  is  illustrated  in  Figure  3*  The  direc- 
tion, extent,  and  magnitude  of  the  bottom  deformation  is 
determined  by  knowledge  of  assumptions  concerning  the  path 
and  final  disposition  of  the  particular  landslide.  The 
water  in  the  reservoir  experiences  an  acceleration  due  to 
the  force  exerted  by  the  landslide  at  the  time  dependent 
boundary  between  the  water  and  landslide.  This  force  per 
unit  mass  consists  of  a component  due  to  the  vertical 
displacement  of  the  water  by  the  slide,  a component  due  to 
the  bottom  friction  between  the  landslide  and  the  water, 
and  a pressure  drag  exerted  on  the  water  by  the  front  of  the 
moving  landslide.  The  pressure  drag  at  the  leading  edge 
of  the  slide  can  be  represented  by 

{V  x = CD  («e  )(Vx  " u)^vx  " u)2  + (Vy  ' v)2^  Az  (7) 

where  is  a pressure  drag  coefficient  and  A0  is  an 

effective  vertical  cross-sectional  area  of  the  slide.  The 
force  per  unit  mass  can  then  be  considered  as; 
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<Vx  i (CD  Az  ><Vx  - u)E<Vx  • u>2  + (Vv  • v)2]5 

MASS  A h 

c 


where  Ac  is  the  grid  cell  area 

The  pressure  drag  contribution 
considered  as: 

Lx  = 0 (Vx  ' u)t(Vx  " u)2 

ly=9  (Vy  - v)[(Vx  - u)2 
cn  K 

where  (3  = ^ (j- ) 


and  h is  the  water  depth, 
per  unit  mass  can  then  be 

+ (vy 

- v)2p 

(8) 

- (vy 

- v)27 

(9) 

(10) 

A representation  of  the  landslide  is  allowed  to  propagate 
into  the  numerical  representation  of  the  reservoir,  acceler- 
ating the  fluid  due  to  physical  displacement,  viscous  effects, 
and  pressure  drag  effects.  The  resulting  waves  propagate 
across  the  reservoir  in  accordance  with  the  governing 
equations.  The  wave  height  and  velocity  components  are 
calculated  for  each  grid  cell  at  the  end  of  each  one  half 
time  step. 


4.  BRIEF  DISCUSSION  OF  PHYSICAL  MODEL  TESTS.  The  num- 
erical results  were  compared  with  experimental  data  from  a 
physical  model  of  Libby  Dam  and  Lake  Koocanusa  (l).  A site 
map  in  Figure  4 shows  the  topography  of  the  steep  rock 
slopes  upstream  of  Libby  Dam.  Potential  landslide  zones 
upstream  of  the  left  abutment  are  denoted  as  rock  ribs  909, 
914,  923*  92?  with  reference  to  the  stationing  along  Montana 
State  Highway  3 7*  The  area  covered  by  the  hydraulic  model 
study  is  shown  by  heavy  dotted  lines.  The  locations  of  two 
prehistoric  landslides  denoted  as  925  slide  and  93°  slide  are 
also  shown  in  Figure  4.  The  Libby  Dam  hydraulic  model  was 
constructed  to  a linear  scale  of  1:120,  model  to  prototype. 

An  undistorted  scale  was  used  to  insure  accurate  reproduction 
of  wave  heights,  wave  period  and  runup.  The  dimensions  of  the 
model  were  57  ft  (17.4  m)  long,  40  ft  (12.2  m)  wide  and  about 
5 ft  (1.5  m)  deep.  The  maximum  elevation  reproduced  in  the 
model  was  2700  ft  (823  m)  msl  with  an  adjustable  mechanical 
inclined  plane  to  support  the  landslide  material  above  this 
elevation.  A range  of  possible  landslide  velocities  was  con- 
sidered at  each  potential  slide  location.  These  velocities 
varied  between  37  fps  (11. 3 m/sec)  and  192  ft/sec  (58*5  m/ 
sec). 
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Wave  heights  at  selected  locations  in  the  model  were 
measured  using  direct  contact  electrical  wave-height  gages. 
The  gage  locations  are  shown  in  Figure  5-  The  runup  dis- 
tances on  the  sides  of  the  model  were  observed  and  the  water 
overtopping  the  dam  was  collected  and  measured. 

Figure  6 shows  the  physical  model  ready  for  a test 
run.  The  landslide  material  used  in  the  model  tests  was 
l/l8  cu.  ft.  (0.028  cu  m)  bags  filled  with  lead  and  iron 
ore.  Photographs  showing  the  final  position  of  the  slide 
were  taken  and  the  approximate  final  contours  of  the 
reservoir  were  determined  after  each  test.  A typical  set 
of  these  data  are  shown  in  Figures  ? and  8 for  Run  87. 

5-  THE  NUMERICAL  CALCULATIONS.  The  numerical  model 
required  that  a rectangular  grid  of  mesh  cells  be  established 
as  well  as  the  computational  boundaries  be  established,  and 
appropriate  information  be  defined  at  discrete  points  in  the 
reservoir.  The  grid  size  is  selected  to  obtain  the  desired 
spatial  resolution.  Initial  data  include  defining  the  reser- 
voir depth  and  a Chezy  coefficient  at  each  grid  line.  The 
time  step  is  chosen  using  as  a general  requirements 


At  < 


(11) 


This  relation  restricts  movement  of  the  water  wave  to  less 
than  one  grid  space  per  time  step. 

_ Defining  the  landslide  characteristics  is  the  most 
critical  aspect  of  the  model.  It  is  necessary  to  know  or 
assume  the  volume  of  the  slide  material,  the  average 
velocity  at  which  it  moves,  its  path  through  the  water, 
the  general  shape  of  its  leading  face,  a time-vertical 
displacement  relationship  for  the  slide  and  the  final  dis- 
position of  the  slide  in  the  reservoir.  In  a general 
investigation  the  use  of  this  model  would  require  a para- 
metric study.  Available  for  this  study  were  experimental 
data  from  a physical  model  study  so  that  these  parameters 
were  known  or  could  be  approximated  to  a reasonable  degree 
of  accuracy.  This  study  then  reflects  the  degree  to  which 
the  mathematical  model  can  represent  the  reservoir  conditions 
if  the  slide  characteristics  are  reasonably  well  defined. 

A time  step  of  1 sec  and  a spatial  grid  size  of  80  ft 
(24.2  m)  were  used  in  the  calculations.  The  pressure  drag 
parameter  p was  varied  between  0.005  and  0.0005  ft-l . 
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Figure  8.  Final  position  of  landslide  material  for  run  8j 
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Figure  11.  Wave  height  as  a function  Figure  14-  Wave  height  as  a function 

of  time  at  gage  3 f or  run  87  of  time  at  gage  6 for  run  87 


knowledge  concerning  prototype  conditions  or  how  to  model 
certain  phenomena.  The  results  indicate  that  the  numerical 
model  is  capable  of  modeling  landslide  generated  water  waves 
sufficiently  accurate  to  allow  overall  engineering  decisions 
to  be  made  concerning  the  possible  effects  of  a potential 
landslide. 

For  small  landslide  velocities  the  viscous  drag  and 
pressure  drag  contributions  to  wave  heights  and  velocities 
are  small  and  the  physical  displacement  of  the  water  by  the 
landslide  is  the  predominate  factor.  The  water  waves  pro- 
duced by  the  slide  will  generally  be  propagating  faster  than 
the  slide  is  moving  and  thus  the  initial  phase  of  the  land- 
slide must  be  accurately  defined  if  good  numerical  results 
are  to  be  obtained. 

For  large  landslide  velocities  significant  contribu- 
tions to  wave  heights  and  propagation  velocities  are  pro- 
duced by  the  viscous  drag  and  pressure  drag.  Viscous  drag 
and  pressure  drag  contributions  to  wave  heights  and  velocities 
are  focused  in  the  direction  of  the  landslide  to  a greater 
degree  than  the  contributions  from  the  physical  displacement 
of  water.  If  the . landslide  is  moving  faster  than  the  normal 
propagation  velocity  for  the  water  waves  it  produces,  the 
entire  path  and  time  history  of  the  landslide  becomes  of 
importance  in  obtaining  an  accurate  prediction  of  the  first 
wave  crest. 

Additional  experimental  work  is  needed  in  which  the 
landslide  parameters,  which  are  required  as  input  to  the 
numerical  program,  are  observed  and  measured  in  greater 
detail.  The  initial  physical  model  study  of  Libby  Dam  and 
Lake  Koocanusa  was  an  end  in  itself  and  not  designed  to  pro- 
vide information  to  verify  a numerical  model.  Fortunately, 
most  of  the  required  information  was  observed  but  not  in 
the  detail  that  would  be  desirable  for  use  in  detailed  veri- 
fication of  a numerical  model. 
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TABLE  1 

RUN  8?  - RIB  92? 

MAXIMUM  SLIDE  VELOCITY  = 84  fps  (25.6  m/sec) 


First  Wave 

Crest,  Ft  (m) 

Arrival  Time  of 

First  Crest,  sec 

Gage  Location 

Physical  Model 

Numerical  Model 

Physical  Model 

Numerical  Model 

1 

14  (4.27) 

16.9  (5.15) 

38 

37-5 

2 

12  (3-66) 

10.7  (3.26) 

36 

35-5 

3 

14  (4.27) 

11.6  (3-54) 

37 

35.5 

4 

15  (4.57) 

15.3  (4.66) 

40 

38.5 

5 

24  (7.32) 

18.0  (5.49) 

41 

41.0 

6 

17  (5-18) 

10.7  (3.26) 

39 

36.O 

7 

25  (7.62) 

25.1  (7.65) 

37 

42.5 

8 

12  (3.66) 

11-1  (3-38) 

32 

32.0 

9 

17  (5. 18 

11.6  (3*54) 

29 

30.0 

10 

7 (2.13) 

5.7  (1.74) 

23 

20.0 

ro 

11 

15  (4.57) 

19.2  (5-85) 

19 

23.5 

00 

12 

12  (3 .66) 

8.3  (2.53) 

19 

18.0 

13 

6 (1.83) 

10.1  (3.08) 

22 

21.5 

14 

5 (1-52) 

— 

37 

— 

15 

— 

— 

— 

— 

16 

— 

— 

— 

— 

Average  difference  in  wave 
heights  between  numerical 
and  experimental  values  23^ 

Average  difference  in  time 
of  arrival  between  numerical 
and  experimental  values  6.2^ 
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NOTATION 


Ac  = grid  cell  area 

= cross-sectional  area  of  landslide 

C = Chezy  coefficient 

CD  = pressure  drag  coefficient 

g = acceleration  of  gravity 

h = depth  of  the  undisturbed  water  surface 

Lx,  L = x and  y component  of  the  acceleration  effect  of 
A y the  landslide  on  the  water 

j » k = indices  for  finite  difference  grid  locations 
n = indices  indicating  multiples  of  the  time  step 

n - unit  vector  normal  to  boundary 

Rx»  Ry  = x and  y component  of  the  bottom  roughness  effect 
t = time 

u - depth-averaged  water  velocity  component  in  the 

x direction 

v = depth-averaged  water  velocity  component  in  the 

y direction 

v = velocity  vector  with  components  u and  v 

Vx  = x component  of  the  landslide  velocity 

V = y component  of  the  landslide  velocity 

w = water  velocity  in  the  z direction 

x,  y = rectangular  coordinate  variables 

3 = pressure  drag  parameter 

n = water  level  displacement  with  respect  to  still 

water  elevation 

C = vertical  bottom  deformation  created  by  landslide 
material 

p = density  of  water 
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ABSTRACT.  The  Euler- Maclaurin  formula  for  numerical  integration  is 

b 2k-l  hmB  . ..  . n 

f f(x)dx  = T ' - £ }(b)  - f m-1)(a)]  - 

a m-Z 

h^_(b-a)B  f(2k)(e)  , 

(2k)! 


where  T^  is  the  trapezoidal  rule 


h n_1 

Tn  = J [f(a)  + f(b)]  + h £ f(a  + ih)  , 

i=l 


h = (b  - a)/n,  B^,  B^,  . . . , B^^  are  Bernoulli  numbers,  and  a<  | < b . The 

application  of  this  formula  can  be  automated  by  using  software  developed  at 
MRC  for  analytic  differentiation  and  interval  analysis.  The  use  of  interval 
techniques  permits  rigorous  bounding  of  the  error  due  to  roundoff,  and  also 
the  truncation  error  by  calculating  an  interval  containing  f^2  ^(£)  . By  use 
of  observed  values  of  the  time  required  for  evaluation  of  the  integrand  and 
experimental  results  on  differentiation  time,  optimal  values  for  n,  k are 
calculated  to  give  a required  accuracy  in  minimum  time,  or  an  estimate  of 
the  ultimate  accuracy  of  the  Euler- Maclaurin  integration  method  may  be  com- 
puted. The  theory  is  illustrated  by  results  obtained  using  a UNIVAC  1108/1110 
program. 

AMS  (MOS)  Classifications  (1970):  65D30,  65G05. 

Key  Words:  Numerical  integration,  Euler-Maclaurin  formulas,  Automatic 
error  estimation,  Interval  integrals. 
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1.  BACKGROUND.  With  the  aid  of  software  for  interval  analysis  [4]  and  automatic 
differentiation,  various  methods  for  numerical  integration  with  rigorous  error 
estimation  [1,  3]  have  been  implemented  as  a program  for  the  UNIVAC  1108/1110  [2]. 
To  be  more  specific,  the  program  described  in  [2]  provides  the  user  with  the 
capability  of  finding  intervals  containing  the  value  of  the  integral 

b 

(1. 1)  2 = J f(x)dx 

a 

by  the  use  of  Riemann  sums,  various  open  and  closed  Newton-Cotes  integration 
formulas,  or  standard  Gaussian  integration  formulas.  The  latter  types  of  inte- 
gration formulas  (Newton-Cotes  and  Gaussian)  are  of  the  form 

(!•  2)  z = r(f)  + e(f)  , 

where  the  rule  r(f)  of  numerical  integration  is  a linear  combination 

n 

(1-  3)  r(f)  = Yj  f(x.  ) w 

i=l 


of  values  of  the  integrand  at  the  nodes  x 
and  the  (truncation)  error  (or  remainder ) term 


.,x,  with  weights  w , 

7 n r 2 n ’ 


(1.4) 


e(f)  = cn(a,  b) 


k ! 


a < % < b , 


is  a multiple  of  the  kth  Taylor  coefficient  of  f,  evaluated  at  some  point  £ in  the 
open  interval  (a,  b)  . In  (1.4),  the  constant  c (a,  b)  is  independent  of  f . A 
numerical  integration  formula  of  this  type  will  l?e  said  to  be  of  order  n and  degree 
k,  and  is  valid  for  integrands  which  are  sufficiently  smooth.  As  a typical  example, 
one  has  the  (extended)  trapezoidal  formula  [5,  p.  170] 


(1-  5) 


fbf(x)dx  = Salifla 

J n 2 


n -1  ~ 

i 

i=l  -> 


a < ^ < b , 


which  is  of  order  n + 1,  n a positive  integer,  and  degree  two. 

As  the  program  has  facilities  for  analytic  differentiation,  it  is  also  possible 
to  implement  formulas  for  numerical  integration  in  which  the  rule  involves  values  of 
derivatives  (or,  equivalently,  Taylor  coefficients)  of  the  integrand.  A class  of 
formulas  of  this  type  may  be  obtained  from  the  Euler- Maclaurin  formula  for  numerical 
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integration,  in  which  the  order  (>  2)  and  (even)  degree  may  be  specified  by  the 
user.  This  formula  and  its  implementation  for  automatic  computation  are  described 
in  the  following  sections. 

As  in  [2],  the  present  program  is  applicable  only  to  integrands  f(x)  which 
may  be  written  in  ordinary  FORTRAN  notation.  In  addition  to  the  variable  of 
integration,  the  integrand  may  contain  one  or  more  parameters  to  be  specified  by 
the  user. 


2.  THE  EULER- MACLAURIN  FORMULA.  Using  the  fact  that  the  Bernoulli  numbers 
B , B , B , . . . of  odd  order  > 3 all  vanish  [5,  p.  218],  the  Euler-Maclaurin 
integration  formula  of  order  n + 1 and  degree  2k  may  be  written  in  terms  of 
values  of  the  integrand  and  its  Taylor  coefficients  as 


(2.1) 


where 

(2.2) 


/ f(x)dx  = hff(a)^f(b)  + Yj  f(a  + ih)l 
a ^ i=l 

k-1  h2mB  r 
vo  Zm  1 

"ii,  2m  L* 


f(2m-^(b) 
(2m-l) ! 


f(2n-l)(a)"| 

(2m-l) ! J 


h2k  a)B  f(2k)l£L 

-h  (b-a)B2k  (2k), 


a < £,  < b , 


h 


b - a 
n 


It  should  be  noted  that  the  values  of  £ for  which  formula  (2.1)  holds  depend  in 
general  on  a,  b,  f,  k,  and  n;  however,  this  dependence  is  suppressed  for  simplicity 
of  notation,  and  will  be  inconsequential  in  the  interval  version  of  the  formula. 

For  practical  reasons,  the  computer  program  is  limited  to  calculation  of 
Taylor  coefficients  of  orders  less  than  twenty,  hence  the  maximum  degree  of  the 
integration  formula  (2. 1)  permitted  without  modification  of  the  program  is  2k  = 18  . 
The  even-order  Bernoulli  numbers  B^,  B , . . . , Bjg  are  given  in  Table  2. 1 [5, 

p.  218]. 
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B2 

1 

’ 6 » 

B4  = 

30  ’ 

B6  = 

1 

42  » 

B8 

1 

Bio  = 

5 

B12 

691 

" “ 30  ’ 

66  * 

“ 2730  ’ 

B14 

7 

B16  = 

3617 

B18  = 

43867 

6 ’ 

510  ’ 

798  ' 

TABLE  2.1.  THE  BERNOULLI  NUMBERS 


B?>  B<i>  * ■ ■ » B 


18 


The  first  term  of  (2. 1), 

(2.3)  tn  = yf)  = h + g f(a  + lh)1 

i=l  J 

is  simply  the  trapezoidal  rule  from  the  integration  formula  (1.  5).  Hence,  the  use 
in  the  second  term  of  (2. 1)  of  values  of  the  Taylor  coefficients  of  f(x)  at  the 
endpoints  a,  b can  be  viewed  as  a method  for  increasing  the  order  of  accuracy  of 
the  trapezoidal  integration  formula. 


3.  THE  INTERVAL  VERSION  OF  THE  EULER- MACLAURIN  FORMULA.  In  order  to 
make  use  of  the  integration  formula  (2,1)  and  take  into  account  the  effects  of 
round-off  error,  uncertainties  in  the  coefficients  of  f(x)  and  perhaps  the  limits 
of  integration,  and  the  unknown  value  of  §,  an  interval  extension  of  the  right- 
hand  side  of  (2.1)  is  calculated  by  the  program.  As  defined  in  [2],  an  interval 
extension  of  a set  of  numbers  is  any  interval  containing  that  set.  (Of  course, 
the  set  being  extended  could  consist  of  a single  number. ) Interval  extensions  of 
numbers  and  functions  will  generally  be  denoted  by  the  corresponding  capital 
letters,  exceptions  being  small  integers  and  the  Bernoulli  numbers.  Taking 

(3.1)  XD  [a,  b] 

and 


(3.2) 


H = 


B-A 

n 


9 
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one  computes  the  interval  version 


(3.  3) 


^ n i 

TT  F(A)  + F(B)  v 

n,k  = H Z + S,  F<fl  + lH>  - 

^ 1-1 


H2rn^mrF|2m-1)(B)  F*21"-1' 


m=l 


[' 


2m  I (2m-l)  ! 


■(2m-1)(A)'1 

(2m-l)  I J“ 


- H2k(B-A)B 


2k  (2k)  ! 


of  the  Euler-Maclaurin  integration  formula. 
For 


(3.4) 


z = J f(x)dx  , 


one  has  that 


(3.  5) 


Z € I 


n,  k 


for  all  positive  integers  k,  n . If  I_  u - [c,  d],  then  one  may  take  the  midpoint 

n^  k 


(3.6) 


* rx  i c+d 

Z - ^n,  d ~ 2 


as  an  approximation  to  z,  with  absolute  error 
(3.7) 


I-  - -*l  S, 


If  the  minimum  bound  for  the  relative  (or  percentage)  error  is  desired  instead, 

then,  provided  0 <|  I , , the  corresponding  estimate  for  z is  the  harmonic 

n,  k 1 1 ■ ■ 1 ■■ 

point 


(3.8) 


**  r.  , 2cd 
Z ‘ ^n,  d " c+d 
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with  relative  error 


(3.9) 


** 

z - z 

d - c 

z 

s P - 

c + d 

The  percentage  error  is,  of  course,  bounded  by  100p  . The  a posteriori  error 
bounds  (3.7)  and  (3.  9)  are  rigorous,  as  follows  from  the  theory  of  interval 
analysis  [3,  4]. 


4.  THE  INTERSECTION  PRINCIPLE  . As  the  computer  program  calculates  Taylor 

coefficients  recursively  [2],  the  values  of  I , I I , . can  be 

n,  I n,  l*  n, 

obtained  in  the  course  of  the  computation  of  I with  very  little  additional 
effort.  For  * 


(4.1) 


k 

i =n 

j=i 


9 


it  follows  from  (3.  5)  that 


(4.2) 


b 

Z = f f(X)  € I . 
a 


The  estimates 

(4.  3)  z*  = h[I],  e = ~ 6[I]  , 


or,  if  0 <j  I , 
(4.4) 


z 


tlfl], 


are  at  least  as  good  as  those  obtained  from  (3.  6)-(3.  9),  and  are  the  ones  actually 
calculated  by  the  program.  This  is  a simple  application  of  the  intersection  principle 
of  interval  analysis,  which  states  that  results  belonging  to  several  intervals  are 
contained  in  their  intersection. 

For  example,  for 
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(4.5) 


sin  x + tan 


dx 


> 


z 


ln(2  +eX) 


one  has 


(4.6)  Z€  Ij  l = [0.43180767,  1.  0839293]  , 

the  interval  form  of  the  trapezoidal  formula,  and 


(4.7) 


Z E 


[0.41804086,  0.  81363283]  , 


and  thus 


(4.8) 


Z € I 


V" 


I =[0.43180767,  0.81363283] 

lj  2 


> 


which  gives  a more  accurate  result. 


5.  OPTIMIZATION.  If  the  values  of  n and  k are  given  by  the  user  of  the 
Euler-Maclaurin  integration  program,  then  the  results  given  by  (4.  3)  or  (4.4)  are 
obtained  directly.  On  the  other  hand,  one  or  both  of  these  parameters  may  be 
determined  by  the  program  in  order  to  achieve  a prescribed  or  maximum  possible 
accuracy  with  the  use  of  the  minimal  amount  of  computational  effort.  This  type 
of  optimization  of  performance  of  the  program  makes  use  of  an  extension  of  the 
strategy  outlined  previously  for  k fixed  and  n arbitrary  [2,  pp.  12-16]  to  the 
case  that  both  the  order  and  degree  of  the  integration  rule  may  vary. 

First  of  all,  for  n fixed,  it  has  been  found  expedient  simply  to  compute 


(5.1) 


rU) 


J 

= n i . 

i=i  n-‘ 


until 


(5.2) 


j(i)  = jU+i)  = jU+2) 
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or  j = 9 - Once  the  optimal  value  j “ j is  determined,  one  sets 


(5.3) 


* 

i = i« ' 


and  obtains  approximate  values  for  the  integral  and  error  estimates  from  (4.  3) 
or  (4.4).  For  example,  for  the  integral  (4.  5), 


(5.4) 


I^2)  = [0.43180767,  0.  813632  83] 


is  optimum  for  n = 1,  while 


(5.  5) 


I*3*  = [0.  65189380,  0.66617152] 


is  optimum  for  n = 2,  and 

(5.6)  I(6)  = [0.  65887366,  0.65890718] 


is  the  optimal  value  for  n = 4 . 


In  the  second  case  of  fixed  k and  arbitrary  n,  the  procedure  is  similar 
to  optimization  of  integration  rules  of  degree  k done  previously  [2,  pp.  12-16]. 
In  order  to  discuss  the  analysis,  write 


(5.7) 


I = T + S 
, k n n, 


+ R 


n,  k 


* 


where 


(5.8) 


Tn  . h|-mb 


> 


the  interval  version  of  the  trapezoidal  rule, 


k-1  HZmB 


(5.  9) 


n,  k 


-■i 


2 m 


m=l 


2m 


I 


F(2m-1)(B) 

(2m-l)  ! 


F(2m"1)(A) 

(2m-l) ! J ’ 


the  correction  term,  and 
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<5-10>  Rn,k=-H2k(B-A)B2k  -W2-  * 

the  interval  remainder  term.  The  error  bound 


(5.11) 


J 6(1  k1  = i 6[Tn!  + ? 6[Sn,  k1  + 7 6[R  k1 


will  thus  depend  on  the  way  in  which  the  widths  of  the  intervals  (5.  8)-(5. 10) 
vary  with  n,  or,  more  conveniently,  as  functions  of 


(5.12) 


h = m. 

n 


The  quantity  6[T  ] turns  out  to  be  essentially  constant  as  a function  of  n,  as 
T is  the  average  of  interval  evaluations  of  the  integrand  f(x),  the  widths  of 
tliese  intervals  depending  on  round-off  and  imprecision  in  the  data  involved  in 
the  definition  of  f(x)  . Thus, 

(5.13)  r=6[T1] 


is  computed,  and  is  taken  as  a measure  of  the  potential  accuracy  of  the  numerical 
integration  procedure.  If  an  error  bound  e is  prescribed  such  that 


(5.14) 


this  accuracy  will  be  considered  to  be  unattainable,  and  the  program  will  print 
an  error  message  [2,  p.  55].  If  an  error  bound  e is  not  specified  in  advance, 
then  the  value  of  r is  used  to  obtain  an  estimate  of  the  maximum  number  p of 
decimal  places  of  accuracy  that  can  be  guaranteed  for  the  approximate  value  of 
the  integral.  One  takes  p to  be  the  largest  positive  integer  such  that 

(5.15)  10-P  > r , 


and  sets 


(5. 16)  e = 5 • 10 

as  the  desired  error  bound  [2,  p.  15].  In  the  program,  (5.16)  is  computed  if  the 
input  value  EPS  = 0 is  specified. 
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Define 


(5.17) 


B2m  f T^m 

2m  (2m-l)  ! 


F(Zm-1)|A) 
(2m-l)  ! 


m = 1,  2, ... , k-1  , 


and 


(5.18) 


= 6 [(B-A,B2k 


F(2k)(X) 

(2k)  ! _ • 


One  has 


(5.19) 


‘I'.klS* 


= r + 


rih 


+ r. 


2k-2 


k-1 


+ th 


2k 


and  thus 


(5.  20) 


i6["  5 


if  n is  chosen  large  enough  so  that 


(5.21) 


p(h)  < 2e  . 


In  order  to  conserve  computer  time,  the  optimal  solution  is  taken  to  be  the  smallest 
positive  integer  n for  which  (5.  21)  is  satisfied.  This  value  is  determined  by  a 
simple  iteration,  taking  into  account  the  fact  that  the  coefficient  t in  (5.19)  is 
ordinarily  much  larger  than  the  values  of  r,  r^,  . . . , r , • One  uses  as  an 
initial  approximation  the  least  positive  integer  n such  that 


(5.  22) 


n > 


5[X] 

2e-r 


7 


and,  if  necessary,  increases  n by  one  until  (5. 19)  is  satisfied  by  the  correspond- 
ing value  of  h . 
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In  the  final  case,  the  values  of  both  n and  k are  to  be  determined  in 
order  to  satisfy  the  error  bound  e with  minimal  computational  effort.  To  do 
this,  some  assumptions  are  made  concerning  the  dependence  upon  k of  the 
width 


(5.23) 


PM 


of  the  kth  interval  Taylor  coefficient  of  the  integrand  f(x),  and  the  time 


(5.24) 


-DM 


required  for  its  computation.  Knowing  these  relationships,  one  could  determine 
if  a given  accuracy  is  best  obtained  by  increasing  n or  k . 

A heuristic  argument  indicates  that  the  values  of  t.  are  given  by  the  sums 
of  an  arithmetic  progression,  while  the  0^  are  obtained  in  the  same  way  from  a 
geometric  progression.  These  observations  are  borne  out  experimentally,  as 
illustrated  in  Table  5. 1 for 


(5.25) 


f(x)  = 


sinx  4 tan  x 
ln(2  + eX) 


X = [0,  1] 


The  program  computes 


(5. 26) 


e0  = e[T 


,,..p 


F(A)  + F(B) 

2 


from  which  the  estimate 


(5.27) 


-if) 


is  obtained  for  9[T  ] . Similarly,  the  values  of  0 


Kf>]  - &] 


obtained  initially  for  k = 0, 1,  2 . Suppose  that  n^  denotes  the  least  value  of 
n satisfying  inequality  (5.  22).  The  corresponding  estimate  for  9[I  .]  is 

taken  to  be  nk> 
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k 

.p>] 

BSM 

0 

0. 130  sec. 

1.4809 

1 

0.259  " 

2. 0949 

2 

0.397  " 

3.  9133 

3 

0. 546  " 

6. 5944 

4 

0.701  " 

1.  1868  X 10 

5 

0.  865  " 

2. 1230  X 10 

6 

1.038  " 

3.  8380  X 10 

7 

1.220  “ 

7. 0895  X 10 

8 

1,409  “ 

2.6012  X 102 

9 

1.609  " 

2. 6012  X 102 

10 

1.817  " 

5.  1869  X 102 

11 

2.034  " 

1.  0605  X 103 

12 

2.261  " 

2.  2173  X 103 

13 

2.498  " 

4.7251  X 103 

14 

2,745  " 

1.  0231  X 104 

i r 

15 

3.  000  " 

2. 2448  X 104 

16 

3. 266  " 

4 

4. 977 1 X 10 

17 

3.541  " 

1.  1131  X 105 

18 

3. 828  " 

2. 5068  X 105 

19 

4.  124  " 

5.  6780  X 105 

TABLE  5.  1.  COMPUTATION  TIMES  AND  WIDTHS 
OF  INTERVAL  TAYLOR  COEFFICIENTS  OF  (5.  25). 
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(5.  28) 


> 


, . M 

n.  ,k  l 2 / 


for  the  additional  time  to  compute  T . Using  the  numbers 

nk 


(5.29) 


t_  B2k+2  62k//62k-l  ’ 


where 


(5.  30) 


an  estimate  n,  , for  n.  , can  also  be  obtained  by  using  the  value  of  t 
k+1  k+1 

in  (5.22)  in  place  of  t . Similarly,  one  estimates 


(5.  31) 


ek+l=36 


&1  1 

_ (2k)  ! J L (2k -1)  ! _ 


Now,  if 


(5.  32) 


6nk,  k < 6 k+1  + 1 2 | 6 O’ 


the  program  computes 


(5.  33) 


2k 

i = n i 


J=1  V1  ' 


otherwise,  the  above  process  is  continued  with  k replaced  by  k + 1 
until  (5.  32)  is  satisfied  or  k = 9 . 
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CANCELLATION  AND  ROUNDING  ERRORS 
J.  Barkley  Rosser  and  J.  Michael  Yohe 


1.  Definition  of  terms.  We  assume  that  all  real  numbers 
with  which  we  will  be  dealing  are  represented  on  the  calculating 
machine  in  floating  point  positional  notation  with  a fixed  radix.  If 
two  such  numbers  are  nearly  equal,  then  the  calculated  difference 
between  them  will  have  more  leading  zeros  than  either  of  the  original 
numbers.  This  will  usually  cause  no  trouble  if  it  happens  that  the 
numbers  are  exactly  representable  on  the  calculating  machine,  though 
even  in  this  case  imperfections  in  machine  architecture  can  result  in 
a botched  answer.  However,  in  the  usual  case,  when  both  numbers  are 
only  approximated  on  the  calculating  machine,  this  phenomenon  will 
commonly  cause  appreciable  relative  error  in  the  answer. 

As  most  real  numbers  can  only  be  approximated  on  a calculating  machine, 
this  danger  is  always  present,  and  steps  should  be  taken  to  minimize 
the  effect. 

Suppose  we  are  given  two  real  numbers  a and  b and  a binary 
operation  ° , and  suppose  also  that  we  have  a machine  M with  the 
corresponding  binary  operation  ° Suppose  further  that  we  have  a 
rounding  operation  p from  the  real  numbers  to  the  set  of  machine 
numbers;  we  shall  assume  that  p rounds  a real  number  to  its  closest 
approximation  on  the  machine. 

Sponsored  by  the  United  States  Army  under  Contract  No.  DAAG29-75-C-0024. 
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We  are  interested  in  how  well  the  machine  answer, 
p(a)  p(b),  will  approximate  the  true  answer,  a ° b:  that  is, 
we  wish  information  on  the  value  of 

(1.1)  |p(a)eMp(b)  - aobl  . 

There  are  actually  three  different  considerations  involved  in 
this  expression;  we  shall  call  them  machine  error  (or  error  due  to 
architectural  deficiencies  in  the  machine),  accumulated  error 
(which  is  due  to  performing  arithmetic  operations  on  approximate 
numbers),  and  rounding  error  (which  results  only  from  the  fact  that 
most  real  numbers  must  be  approximated).  We  shall  now  define  these 
types  of  error  precisely. 

Machine  error  is  the  amount  by  which  the  machine  result 
fails  to  be  the  best  possible  result  derivable  from  the  rounded  numbers 
being  given,  including  the  fact  that  the  result  must  be  rounded;  i.e.  , 
machine  error  is  defined  to  be 

(1.2)  |p(a)°Mp(b)  - p(p(a)»p(b))  | . 

In  a well-designed  machine,  (1.  2)  can  be  made  to  vanish;  see  [9]  for 
details.  Thus  this  type  of  error  is  completely  avoidable.  Unfortunately, 
it  is  often  not  avoided;  we  have  even  seen  a calculating  machine  on 
which,  for  a large  class  of  real  numbers,  a,  we  have  a 1 - a x 1 # 0 
in  spite  of  the  fact  that  p(a)  = a for  this  class! 
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Accumulated  error  is  the  amount  by  which  the  rounded  result 
of  performing  the  real  operation  using  rounded  operands  differs  from 
the  rounded  result  using  the  original  operands;  i.e.  , 

(1.3)  |p(p(a)°p(b))  - P(a°b)  I . 

In  general,  accumulated  error  can  not  be  avoided,  but  it  can  be 
controlled  to  a certain  extent  by  selecting  computational  formulas 
judiciously,  as  we  shall  see  in  the  sequel. 

Rounding  error  is  simply  the  amount  by  which  the  rounded 
result  differs  from  the  true  result,  i.e,  , 

(1.4)  lp(a°b)  - a°b  | . 

Rounding  error  must  be  regarded  as  totally  unavoidable  in  the  context 
in  which  we  are  working. 

We  observe  that 

ip  (a)  °MP(b)  - a°b  I 5 

|P(a)  °Mp(b)  - p(p(a)°p(b))  | + 

(1.5) 

+ |p(p(a)°  p(b))  - p(a»b)  | + 

+ |p  (a  °b)  - a »b  | . 
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Hence,  by  (1.  2)  - (1.4),  the  total  absolute  error  is  not  greater  than 
the  sum  of  machine  error,  accumulated  error,  and  rounding  error. 

Since  machine  error  is  avoidable,  we  shall  not  treat  it  here, 
even  though  it  will  be  exhibited  in  some  of  our  discussions:  we  shall 
concentrate  on  accumulated  error  and  rounding  error.  These  can  be 
lumped  together  and  called  "error  of  approximation";  a fuller 
discussion  of  approximation  generally  can  be  found  in  Yohe 
[10,  p.  10]  . However,  since  rounding  error  is  totally  unavoidable, 
the  major  thrust  of  this  paper  will  be  in  dealing  with  accumulated 
error. 

In  most  cases  the  relative  error  associated  with  an  instance 
of  accumulated  error  behaves  rather  predictably.  That  is,  the  relative 
error  of  a result  in  a case  of  accumulated  error  is  related  in  a rather 
straightforward  manner  to  the  relative  errors  in  the  operands.  In  one 
case,  however,  the  relative  error  can  grow  catastrophically;  the 
subtraction  of  two  nearly  equal  quantities  can  cause  the  relative 
error  to  grow  by  many  orders  of  magnitude.  This  particular  instance 
of  accumulated  error  is  so  serious  that  we  give  it  a special  name: 
cancellation  error. 

As  an  example,  consider 

(1.6)  ~ - it  • 

We  have 
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~ B 3.1428  57  1 428  57 


it  S 3. 1 41  59  2653590 
~ - ir  & 0.001  264489267  35. 

However,  suppose  the  calculation  is  done  on  the  8 -place  decimal 
machine  belonging  to  one  of  the  authors.  We  enter  the  approximations 

?z 

(1.7)  “ ss  3.1428571 

(1.8)  it  ar  3.1415927. 

Subtracting  these  gives 

(1.9)  -y  - it  s 0.001  2644  . 

The  value  of  (1.6)  is  calculated  by  the  machine  to  only  five 
significant  decimals,  of  which  the  last  is  incorrectly  rounded.  Thus 
we  have  lost  at  least  three  decimal  places  of  accuracy. 

The  cause  of  cancellation  error  is  the  fact  that  when  the 
calculating  machine  is  called  upon  to  subtract  two  nearly  equal 
numbers,  the  early  digits  cancel,  and  fewer  significant  digits  will  result, 
as  shown. 

Rounding  error  usually  occurs  as  soon  as  a number  is 
entered  into  the  calculating  machine.  Thus,  in  (1.7),  the  value  given 
is  too  small,  though  it  is  the  best  possible  on  an  8 -place  machine.  In 
(1.8),  the  value  given  is  too  large,  though  it  is  the  best  possible  on  an 
8 -place  machine.  When  one  performs  arithmetic  operations  on  such 
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approximate  numbers,  the  errors  already  present  can  accumulate. 

Thus,  in  (1.9),  not  only  are  we  giving  only  five  significant  digits  (due 
to  cancellation  error),  but  we  are  not  even  giving  the  five  best  digits. 

A more  accurate  five  digit  value  would  be 

22 

" 0.0012645  . 

The  additional  discrepancy  (getting  0.0012644  instead  of  0.0012645) 

was  due  to  accumulation  of  rounding  errors. 

Although  we  will  not  discuss  it  in  this  report,  we  should 

recognize  yet  one  other  source  of  error  in  using  a calculating  machine. 

This  is  numerical  instability,  which  arises  when  the  method  of  computation 

is  such  that  the  percent  error  at  one  step  is  multiplied  by  a constant  greater 

than  unity  in  making  the  next  step. 

If  one  can  use  double  or  triple  precision  arithmetic,  so  that  one 

has  a superfluity  of  digits,  losing  a few  digits  by  cancellation  errors  or 

rounding  errors  will  likely  do  no  harm.  However,  one  should  give  thought 

to  the  details  of  the  calculation,  since  occasions  can  arise  in  which 

double  precision  (or  even  triple  precision  !)  will  not  suffice  to  avoid 

serious  errors,  or  even  complete  nonsense. 

2.  Cancellation  errors.  As  noted,  cancellation  errors  arise 

when  one  subtracts  two  nearly  equal  numbers.  Addition  of  positive  numbers 

causes  no  appreciable  cancellation  errors.  Likewise  for  addition  of  negative 

numbers . Also , the  way  most  calculating  machines  are  built , multiplying  or  dividing 
two  numbers  produces  no  appreciable  cancellation  error  . 
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Many  times  when  one  has  a formula  involving  subtraction  that 
leads  to  cancellation  errors,  one  can  find  a mathematically  equivalent 
formula  for  which  there  is  no  cancellation  error.  A classic  illustration 
arises  in  solving  quadratic  equations.  Suppose  one  wishes  both  roots  of 

2 

(2.1)  x - 200x  +1  = 0, 

The  formula  for  the  roots  is 

-b  + *7  b^-4ac 
2a 

With  the  coefficients  shown,  this  gives 

200  * "7 40000  - 4 


n/ 40000  - 4 s 199.9899997499875  . 

To  get  the  larger  root,  we  enter  200  into  the  machine,  and 
the  best  possible  eight  digit  approximation  for  the  square  root,  namely 
199.99000.  When  we  add  and  divide  by  2,  we  get  199.99500,  which  is 
correct  to  eight  digits.  When  we  subtract  and  divide  by  2,  we  get 

0.0050000000  . 

To  eight  places,  the  answer  should  be 

0.0050001250  . 

Here  we  have  a cancellation  error. 
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In  this  case,  one  can  find  a mathematically  equivalent  formula 
for  the  roots  by  multiplying  both  top  and  bottom  of  the  quadratic  formula 
by 

-b  T nA)2  - 4ac  . 

This  gives 


2c 

-b  T*/b  -4ac 

Then  for  the  small  root,  we  have 


2 

200  + 199.99000 


0.0050001250  . 


As  a matter  of  fact,  the  situation  could  have  been  considerably 
worse.  Suppose  we  had  tried  to  get  the  roots  of 

x2  - 1 OOx  + 1 = 0 , 


This  gives  the  roots 


100  ± V 10000-4 
2 


We  have 

's/l0000-4  = 99.9799979996  . 

We  enter  100  into  the  machine  and  the  best  possible  eight  digit  approxima- 
tion for  the  square  root,  namely  99.979998.  As  the  calculator  is  an  eight 
digit  machine,  it  must  reduce  these  to  the  same  number  of  decimals  before 
adding  or  subtracting.  It  shows  100  as  100.00000;  to  get  the  other  number 
to  match,  it  truncates  the  final  8,  so  that  it  gives 


453 


100  4-  99.97999 
2 


99.989995  . 


To  eight  places  the  larger  root  should  be  approximated  by  99.989999. 

This  is  not  a cancellation  error.  This  is  poor  machine  architecture. 

We  should  note  that  some  very  large  and  very  expensive  calculating 
machines  make  the  same  sorts  of  blunders.  For  instance,  the  UNIVAC  1108 
and  1110  do  so.  Other  vagaries  of  calculating  machines  are  discussed  in 
Kahan  [6]  . A discussion  of  hew  to  improve  calculating  machines  in 
matters  of  this  sort  is  given  in  Yohe  [9] . 

It  is  fairly  common  that,  with  a little  ingenuity,  one  can  find 
for  a formula  that  produces  cancellation  error  another,  mathematically 
equivalent,  formula  that  does  not.  Thus,  in  Rosser  [1]  , we  had  the 
formula  numbered  (2.44) 


°„+1  ' *(an  ' bn>’ 


which  in  Table  2.6  did  not  even  give  the  right  order  of  magnitude  for 

c..  . However,  the  formula  turns  out  to  be  mathematically  equivalent 
11 

to  the  formula  numbered  (2.88) 


n+1  4a  . ’ 

n+1 

which  clearly  has  no  cancellation  error.  Or  in  the  same  report,  the 
formula  numbered  (2.95)  involved  a difference,  and  gave  disastrous 
results;  the  mathematically  equivalent  formula  numbered  (2.98)  involved 
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a sum,  and  gave  quite  satisfactory  results.  Of  course,  just  having  an 
algebraic  sum  won't  guarantee  avoidance  of  cancellation  error;  one 
summand  could  be  nearly  the  negative  of  the  other.  However,  the 
formula  numbered  (2.98)  involved  a sum  of  squares,  and  hence  could 
be  relied  on  not  to  give  a cancellation  error. 

So,  for  purposes  of  calculation,  close  attention  should  be  paid 
to  trying  to  replace  formulas  that  could  produce  cancellation  errors  by 
mathematically  equivalent  formulas  which  do  not.  Sometimes,  this  does 
not  seem  possible.  Thus  in  the  formula  numbered  (3.27)  in  Rosser  and 
Papamichael  [ 2] , we  are  undertaking  to  compute  the  recursively 
from 


i;  m 


E A(2r-D 
r N+2-r 


N - 

The  Ef  are  known,  with  £^2.4.  The  value  (-1)  B*DN  is  approximately 

1.5.  The  J ^ can  be  calculated  from  A ...  A . Now  the  A 
N+2-r  1 N N 

-*8 

decrease  in  absolute  value;  A2Q  = -10  . So  obviously  use  of  the 


formula  shown  will  have  to  involve  serious  cancellation  errors  for  the 


larger  values  of  N.  However,  we  were  not  able  to  find  any  other  way  to 
calculate  that  A^.  By  using  double  precision,  we  were  able  to  get  about 
two  significant  digits  for  A . To  have  gone  appreciably  further  would 
have  required  a triple  precision  calculation. 
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Incidentally,  this  also  disclosed  a clearly  marked  case  of 
numerical  instability.  If  one  works  out  the  coefficient  of  AN  on  the 
right  side  of  (2.  2),  it  is  of  the  order  of  3.  In  other  words,  whatever 
absolute  error  we  make  in  A^,  we  are  guaranteed  to  make  an  absolute 
error  about  3 times  as  large  in  an+v 

3.  Rounding  errors.  As  we  indicated,  rounding  errors 
originate  from  the  fact  that  usually  a calculator  can  hold  only  an 
approximation  to  a given  number.  When  operations  are  performed  on  these 
approximate  numbers,  the  errors  tend  to  accumulate.  A classic  treatise  on 
this  subject  is  Wilkinson  [ 3]  . 

Roughly  speaking,  if  one  multiplies  or  divides  two  numbers,  the 
percent  error  in  the  product  or  quotient  can  be  as  large  as  the  sum  of  the 
absolute  percent  errors  in  each  of  the  participating  numbers.  It  need  not 
be  that  large,  and  often  will  not  be.  When  it  comes  to  adding  or  subtracting 
two  numbers,  the  situation  is  much  more  complex.  In  subtracting,  the 
absolute  error  of  the  difference  might  be  expected  not  to  be  more  than  the 
sum  of  the  absolute  errors  of  the  participating  terms  (this  is  not  always  so) 
but  if  there  is  cancellation  error,  the  percent  error  could  make  a sensational 
jump. 

Thus,  in  our  early  example,  we  had  an  approximation  for  22/7 

with  about  (1.4  X 10  ^)%  error  and  an  approximation  for  it  with  about 
-6 

(1.5  X 10  )%  error,  but  got  an  approximation  for  (22/7)  - -nr  with  about 

(7. 1 X 10  ■*)%  error. 
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Even  In  addition  of  positive  numbers,  the  situation  is  murky. 

It  might  seem  obvious  that  if  one  has  approximations  a^,...,a^  on  the 
machine,  eachofwhichis  not  more  than  e percent  less  than  the  true  value, 
then  the  sum  a^+a2+.  . ,+a^  will  not  be  more  than  e percentless  than  the  sum 
of  the  true  values.  This  seems  merely  to  be  a special  case  of  the 
distributive  law  of  algebra.  However,  it  is  not  so.  See  page  11  of 
Yohe  [ 10] . 

Consider  the  divergent  series 
(3.1)  , + t +t  +t  + •••  • 

Let  us  propose  to  sum  this  (or  parts  of  it)  on  an  eight  decimal  digit 

calculating  machine  which  can  accept  or  handle  numbers  down  to  an 

underflow  limit  of  about  10  *°°.  Each  term  of  the  series  (down  to  the 

underflow  limit)  can  be  entered  on  the  calculator  to  within  e percent 

accuracy.  Suppose  we  do  this  successively,  and  add.  With  an  8 -place 

8 

decimal  calculating  machine,  after  fewer  than  10  terms  the  terms  will  be 
so  small  that  when  they  are  added  they  will  make  no  change  in  the  calculated 
sum.  From  that  point  on  the  calculated  sum  will  remain  the  same,  though 
the  true  sum  will  rise  gradually  into  the  hundreds  until  we  come  to  terms 
so  small  that  underflow  limitations  prevent  us  from  entering  them  into 
the  machine,  and  the  summation  is  perforce  terminated. 

Of  course,  this  is  a very  contrived  situation,  but  it  illustrates 
why  it  is  difficult  to  make  any  general  remarks  about  rounding  errors  for 
addition,  and  even  less  for  subtraction.  However,  in  general,  a group 
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of  numbers  to  be  added  will  not  be  too  disparate  in  size,  and  something 
like  the  distributive  law  can  be  invoked,  but  with  caution. 

Usually,  with  ideal  rounding,  the  errors  from  rounding  are 
reasonably  random.  This  tends  to  hold  down  the  accumulation  of  errors. 

A rather  simplistic  statistical  model  is  often  invoked;  see  Henrici  [ 4] , 
page  305,  ff.  Even  more  elaborate  discussions  can  be  found,  as  in 
Rademacher  [ 7] . For  the  simplistic  model,  the  argument  is  roughly 
as  follows.  If  an  error  of  6 could  occur  at  each  operation,  and  if  the 
error  should  occur  in  the  same  direction  each  time,  then  in  n operations 
a total  error  of  n6  would  accumulate.  However,  if  the  errors  should 
fluctuate  randomly  between  positive  and  negative,  then  in  n operations 
a total  error  of  k's/n  6 would  (usually)  occur.  People  even  try  to 
determine  a suitable  value  to  assign  to  k.  However,  this  is  hardly 
justified. 

Consider,  if  we  have  n positive  summands  all  about  the  same 
size,  and  all  too  small  by  6,  the  error  after-  n additions  should  be 
n6;  the  sum  will  also  be  about  n times  each  individual  term,  so  that 
the  percent  of  error  has  not  increased  (the  distributive  law  again). 
Actually,  the  way  most  calculators  work,  the  error  after  n additions 
could  be  much  more  than  nS  for  very  large  n.  Review  again  the  example 
with  the  divergent  series.  If  n is  large  enough,  one  can  reach  the  point 
where  adding  another  number  makes  no  change.  Then  the  error  increases 
much  faster  than  by  6 at  each  step.  However,  if  the  addition  is  properly 
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arranged  (see  discussion  below),  the  distributive  law  will  be  reasonably 
well  approximated,  and  the  error  will  indeed  be  about  n6,  to  be  com- 
pared with  a sum  about  n times  the  individual  summands. 

Suppose  that,  because  of  randomness,  the  error  is  claimed  to 
be  k'Vn  6 . Suppose  that  a value  has  been  assigned  to  k,  so  that  with 
the  given  value  of  n the  error  is  claimed  to  be  only  nfi/iO  . If  the 
summands  are  all  about  the  same  size,  the  sum  is  about  n times  an 
individual  term.  So  we  claim  an  accuracy  for  the  sum  ten  times  as  good 
as  for  the  individual  terms.  If  each  summand  was  entered  to  maximum 
accuracy  on  an  8 -place  calculator,  then  it  would  have  the  sum  correct 
to  9 places,  which  is  obviously  impossible. 

However,  there  are  many  cases  where  the  general  distribution  of 
errors  is  random  enough  that  the  predictions  of  the  statistical  theory  seem 
reasonably  well  fulfilled.  The  total  error  (in  some  sense)  grows  at  a 
rate  proportional  to  Vn  rather  than  to  n.  For  additional  exceptions, 
see  the  next  section. 

In  Crary  and  Rosser  [ 5] , power  series  coefficients  for  41  functions 
were  calculated  to  high  accuracy;  between  40  and  50  coefficients  were 
computed  for  each  function.  In  the  report,  these  coefficients  were  uniformly 
rounded  to  considerably  fewer  decimals  at  print  out  than  were  carried  in  the 
calculation.  Among  the  checks  of  accuracy  of  transcription,  the  sum  of 
the  coefficients  for  each  function  was  also  computed  to  high  accuracy. 
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For  each  function,  the  transcribed  and  rounded  coefficients  were  added 
to  see  how  their  sum  compared  with  the  true  sum  (see  page  48  of  Crary 
and  Rosser  [ 5] ).  The  results  are  shown  in  Table  1 . This  is  a reasonable 
approximation  to  a Gaussian  distribution. 


Sum  too  high  by 

Number  of  cases 

+ 4 

2 

+ 2 

3 

+ 1 

9 

0 

12 

-1 

9 

-2 

2 

-3 

4 

Table  1 . 

We  might  remark  that  in  summing  (3.1)  we  used  the  worst  possible 
procedure.  It  is  standard  doctrine  that,  in  summing  a series,  rounding  error 
will  accumulate  more  slowly  if  one  adds  starting  with  the  small  numbers . 

The  reason  for  this  is  that  if  one  adds  the  large  terms  first,  then  when 
a small  term  is  added  in,  one  must  truncate  it  or  round  it  to  bring  it  to 
the  right  number  of  decimals,  and  information  is  lost.  If  one  adds  several 
small  terms  together  first,  this  information  is  used  (in  part)  to  form  an 
accurate  partial  sum  of  a size  more  comparable  to  the  large  terms.  Still 
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better,  if  the  series  is  suitably  graduated,  would  be  to  keep  the  partial 
sums  more  nearly  of  a size  with  each  other.  In  the  series  (3.1),  this 
could  be  done  very  nicely  by  adding  the  terms  in  pairs,  then  adding 
the  pairs  in  pairs,  etc.  With  this  system,  one  could  possibly  get  a 
sum  of  any  part  of  the  series  (3.1)  to  about  the  full  accuracy  which  the 
machine  could  accomodate. 

If  one  wishes  to  calculate  the  determinant  of  a large  matrix, 
one  can  be  subject  to  both  cancellation  and  rounding  errors.  By  definition, 
the  determinant  involves  a large  number  of  differences,  so  that  there  is 
great  opportunity  for  cancellation  errors . It  involves  a large  number  of 
terms,  so  that  rounding  errors  can  accumulate  badly.  Closely  related  is 
the  problem  of  solving  a large  number  of  simultaneous  linear  equations. 
With  proper  techniques,  one  can  sidestep  some  of  the  difficulties. 
However,  it  remains  a problem  in  which  even  the  best  methods  can 
occasionally  fail  disastrously.  A very  full  discussion  is  given  in  Kahan 
[8]. 

4.  A particular  example.  Define 

(4.1)  £ = 0.000013516  (octal). 

Then 

(4.2)  e a 0.0000  44450  16384  (decimal)  . 
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Let  us  enter  e into  a calculating  machine  with  9 -place  octal 
accuracy  (floating),  compute  r = 1 - e,  and  then  erase  the  value 

of  e.  How  well  can  the  computer  recover  e from  r? 

As  an  8 -place  decimal  machine  is  nearly  equivalent  to  a 9 -place 
octal  machine,  we  will  write  much  of  our  discussion  as  though  foran8-place 
decimal  machine;  this  machine  is  assumed  to  be  designed  in  such  a manner  as 
to  truncate  prior  to  addition,  so  that  machine  error  also  occurs. 

The  first  thing  is  to  make  sure  we  subtract  e properly  from  1 . 
The  machine  will  hold  1 as 


1. 0000000, 

being  an  8 -place  machine.  To  subtract  e,  it  will  truncate  it  to 

0.0000444  . 

Subtracting  will  give  r = 0.99995560.  Already,  we  have  lost  valuable 
information. 

What  we  do  is  to  calculate 

r = -£)  + ■§. 

This  will  give 

(4.  3)  r £ 0.99995555  . 
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There  is  no  cancellation  in  the  denominator  of  (4.4).  So  we  should 
be  able  to  get  it  accurate  to  about  8 significant  digits.  So  then  (4,  4) 
should  give  e good  to  nearly  8 significant  digits ! 

Of  course,  one  has  to  be  smart  about  summing  the  denominator, 
as  discussed  near  the  end  of  the  previous  section.  We  carried  out  a 
number  of  calculations  on  the  UNIVAC  1108,  which  has  9-place  octal 
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accuracy  (floating).  In  each  case,  Just  to  compare  results,  we  summed 

the  denominator  of  (4.4)  in  three  different  ways:  summing  from  the  left, 

summing  from  the  right,  and  iterated  summing  by  pairs. 

The  first  question  is  about  the  calculation  of  r^.  The  analysis 

given  in  Wilkinson  [ 3]  applies  perfectly  to  this  calculation.  If  the 

rounding  is  biased,  the  accumulation  of  rounding  errors  could  be  so  severe 

N 

that  we  would  get  an  approximation  for  r correct  to  only  two  significant 
decimal  digits.  In  fact,  after  multiplying,  the  UNIVAC  1108  truncates  instead 
of  rounding,  thus  introducing  machine  error.  After  262,144  multiplications  by  r, 
we  got  r^  £*8,68  4x10  ^ , as  contrasted  with  the  more  accurate  value  of 

(4.5)  rN  s 8.696  3885  X lo'6 

(got  by  a double  precision  calculation).  However,  even  this  poor  value 
N 

would  give  1-r  with  an  error  of  only  2 units  in  the  8-th  decimal  digit. 

However,  one  cannot  expect  the  denominator  of  (4.4)  to  be  very 
good,  with  truncation  instead  of  rounding  at  each  step.  We  got  the 
results  shown  in  Table  2.  Considering  the  bias  in  the  rounding,  these 
are  not  too  bad. 


Methods  of  summing 
the  denominator 

e X 10  5 

true  value 

4.445  0164 

summing  from  the  left 

4.450  5777 

summing  from  the  right 

4.446  0810 

iterated  summing  by  pairs 

4.445  5325 

Table  2. 
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In  Yohe  f 9]  , it  is  shown  how  various  sorts  of  rounding,  including 

various  types  of  "ideal"  rounding,  could  be  accomplished  by  simple 

hardware  or  simulated  by  software  on  most  computers.  We  programmed 

the  UNIVAC  for  ordinary  rounding,  thus  eliminating  machine  error  as  defined 

by  (1.2).  Presumably  the  statistical  theory  should  have  some  validity  in 

N -6 

this  case.  Indeed,  we  did  far  better.  We  got  r s 8.696  3897  X 10 
So  we  have  full  accuracy  for  the  numerator  of  (4,  4).  We  got  the  results  shown 
in  Table  3.  It  appears  that  we  have  recovered  7 of  the  8 digits  of  e,  and 
are  off  only  3 units  in  the  8-th  place. 


Methods  of  summing 
the  denominator 

e X i05 

true  value 

4.445  0164 

summing  from  the  left 

4.445  3544 

summing  from  the  right 

4.445  01 77 

iterated  summing  by  pairs 

4.445  0167 

Table  3. 


As  long  as  we  are  looking  for  convenient  forms  for  the  calculation 

18 

of  e,  we  might  note  that  since  N = 2 , we  can  write 

?18 

1 - r ^ 

(4.6)  1 - r - . 

(1+r)(1+r2)(1+r4)(l+r^).  . . (1+r2  ) 
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This  involves  very  few  multiplications  and  additions,  and  should  be 
still  better.  However,  it  is  not.  It  gives  4.445  1465  as  an  approximation 
for  e X 10^  . 

The  reason  for  this  appears  if  we  give  a little  thought  to  the 
2 4 8 

matter.  We  computed  r , r , r , ...  by  successive  squarings.  If 

M M 

A = (1  + e)r  is  an  approximation  for  r , then 


a2  = 


(1  + 2e  + 


2.  2M 
e )r 


Of  course,  in  general,  A will  not  be  a number  that  can  be  stored  in 
the  machine.  The  result  of  multiplying  A by  A on  the  machine  will  be 
B , where 


So  the  approximation  B for 


B = (1  + 6)A2  . 
2M  .. 

r is  related  to 


2M 

r 


by 


B = (l  + 6)(1  + 2e+e2)r2M  . 

If  6 and  e have  opposite  signs,  B could  be  as  good  an 
approximation  for  r2^1  as  A was  for  r^,  possibly  even  better. 
However,  with  reasonable  randomness  of  rounding  errors,  there  will 
come  a time  when  6 and  e have  the  same  sign.  So  we  will  from 
time  to  time  get  a fairly  large  e,  and  eventually  the  2e  term  will 
become  overriding.  From  then  on,  we  essentially  double  the  percent 
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error  at  each  squaring.  This  is  an  example  of  numerical  instability, 
which  we  mentioned  earlier  as  a possible  source  of  error.  The  value  of 
rN  as  calculated  by  successive  squaring  on  the  UNIVAC  1108  with 
rounding  was  approximately  8.69  3 4161  X 10  6.  This  is  not  a whole  lot 
better  than  the  value  derived  by  262,144  successive  multiplications  by 
r with  truncation  at  each  step. 

5.  Rounding  error  depends  on  the  number  base  used  in  the 
calculator.  But  of  course!  However,  the  effect  can  be  much  greater 
than  one  might  expect.  For  example,  the  black  magic  in  the  previous 
section,  in  which  we  were  apparently  recovering  almost  eight  digits  of 
e from  1-e  = 0,99995555,  was  wholly  an  illusion  caused  by  the 
differences  in  rounding  between  a 9 -digit  octal  machine  and  an  8 -digit 
decimal  machine.  Recall  that  we  took 


£ = 0.0000  13516  (octal) 


exactly.  Then 


r = 0. 7777  64262  (octal) 


exactly.  So 


(5.1)  1-r  = 0. 0000  13516  0000  (octal) 
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exactly.  The  formula  (4.4),  when  applied  properly,  will  give  this 
value  to  nearly  9 significant  octal  digit  accuracy.  Converted  to  decimal, 
this  means  we  were  getting  close  to 

0.0000  4445  0164 

for  e;  the  illusion  was  created  that  we  were  recovering  nearly  8 
decimal  digits  of  e. 

As  another  illustration  of  a startling  divergence  due  to  differences 

N 

of  rounding,  recall  that  when  we  calculated  r by  successive  squaring, 
using  octal  rounding,  we  got 

8.693  4161  X lo"6 

as  compared  with  a more  accurate  value  of 

rN  ss  8.696  3885  X io"6  . 

However,  if  we  start  with  the  decimal  equivalent  of  r, 

r s 0.99995555, 

and  perform  successive  squarings  with  decimal  rounding,  we  get 

N 

r = 8.696  0578. 

If  used  in  (4.6),  this  would  hove  given 
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C £ 4.445  0305  x 10 

If  this  paper  has  a moral,  it  is  that  cancellation  and  rounding 
errors  are  more  serious  than  is  generally  believed,  especially  if  one 
runs  into  some  numerical  instability.  Mitigation  of  these  effects 
due  to  randomness  of  errors  is  not  as  trustworthy  as  one  might  believe 
from  the  statistical  theories  that  have  been  propounded.  However,  the 
remedy  used  by  many  people,  of  going  to  a double  precision  calculation 
If  any  suspicion  of  unreliability  appears,  will  probably  be  entirely 
adequate  in  all  but  an  extremely  small  minority  of  cases.  Nonetheless, 
even  with  double  precision,  it  is  worthwhile  giving  thought  to  whether, 
of  several  mathematically  equivalent  forms,  one  has  chosen  the  one 
least  likely  to  produce  cancellation  errors.  Also,  if  one  has  a sum  involving 
a large  number  of  summands,  it  is  worthwhile,  and  not  much  trouble,  to 
give  thought  to  the  best  order  of  performing  the  summation. 
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APPLICATIONS  OF  NUMERICAL  MODELING 
TO  COASTAL  ENGINEERING  PROBLEMS 
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ABSTRACT.  A review  of  the  numerical  modeling  efforts  within  the 
Wave  Dynamics  Division,  Hydraulic  Laboratory  at  the  Waterways  Experiment 
Station  is  presented.  Numerical  modeling  has  progressed  rapidly  in  the  last 
several  years  and  Is  now  generally  recognized  as  a useful  tool  capable  of 
yielding  valuable  information  on  alternative  solutions  to  many  coastal 
engineering  problems.  Present  efforts  include  models  for  application  to: 
Wind-driven  circulation;  storm  surge  on  a lake;  tsunami  generation  and 
transoceanic  propagation;  tidal  hydraulics  of  bays,  harbors,  and  inlets; 
generation  and  propagation  of  landslide,  generated  water  waves;  and  harbor 
oscillations,  both  free  and  forced.  A discussion  of  the  mathematical 
formulations  and  applications  Is  given  herein.  Future  efforts  are  dis- 
cussed which  include  stratified  lake  circulation  and  both  deep  and  shallow 
water  wind  wave  generation. 

1.  INTRODUCTION.  The  advent  of  large  scale  computer  systems  has 
made  it  possible  to  use  hydrodynamic  theory  based  upon  rational  physical 
approximations  rather  than  the  dictates  of  mathematical  tractability . 
Constraints  still  exist  due  to  computer  limitations  and  lack  of  basic 
understanding  of  certain  phenomena;  however,  numerical  modeling  has 
progressed  rapidly  in  the  last  several  years.  The  mathematical  model  is 
now  generally  recognized  as  a tool  capable  of  yielding  valuable  information 
on  the  particular  phenomenon  under  investigation. 

All  of  the  models  discussed  in  this  paper  are  based  on  long  wave 
theory  approximations.  There  are  many  problems  to  which  long  wave  theory 
can  be  applied  and  the  degree  of  validity  of  each  application  is  dependent 
on  the  particular  approximation  made  and  must  be  carefully  evaluated.  Long 
wave  theory  is  a valid  approximation  when  the  ratio  of  water  depth  to  wave 
length  is  small  and  the  vertical  component  of  motion  does  not  significantly 
influence  the  pressure  distribution  which  is  assumed  to  be  hydrostatic. 

A wide  range  of  applications  together  with  model  formulations  are 
presented.  It  is  not  the  intent  of  this  paper  to  present  detailed  descrip- 
tions of  models,  but,  instead,  to  indicate  the  level  of  effort  being 
undertaken  by  various  members  of  the  staff  at  the  Waterways  Experiment 
Station.  The  reader  is  asked  to  consult  the  appropriate  reference  to  obtain 
details  of  the  model  formulations* 


This  paper  was  presented  at  the  1975  Army  Numerical  Analysis  and 
Computers  Conference. 
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2.  SCOPE  OF  MODELING  EFFORTS.  Dynamic  prediction  equations  based 
on  the  fundamental  differential  equations  of  hydrodynamics  can  be  applied 
with  minor  variations  to  provide  data  on  wave  motion  where  the  depth  of 
water  is  small  relative  to  the  wave  length,  A linearized  version  of  the 
model  expressed  in  a spherical  coordinate  system  has  been  used  to  calculate 
the  propagation  of  tsunamis  (originating  from  seismic  disturbances  in  12 
segments  of  the  Aleutian  Trench)  from  their  source  to  the  west  coast  of 
the  United  States.  The  results  were  used  to  evaluate  the  relative  vul- 
nerability of  an  area  along  the  west  coast  to  tsunami  inudation  as  a 
function  of  seismic  location. 

A model  based  on  the  same  fundamental  equations,  but  cast  into  a 
Cartesian  coordinate  system,  has  been  applied  in  analyzing  the  tidal 
hydraulics  associated  with  harbors  and  inlets.  Additional  terms  including 
non-linear  inertial  forces  (convective  forces)  and  bottom  stress  were  con- 
sidered. The  numerical  tidal  model  has  been  used  as  a complementary  tool 
in  the  operation  of  a physical  model  of  the  Los  Angeles  and  Long  Beach 
harbor  complex.  Such  a numerical  model  can  provide  valuable  information 
for  use  in  the  design  and  operation  of  physical  models.  Some  of  the  areas 
to  which  the  numerical  model  can  be  of  assistance  are  documented  below: 

a.  Establishing  appropriate  boundaries  during  design  of  the  physical 
model  which  are  sufficiently  removed  from  the  influence  of  regions 
to  be  investigated. 

b.  Indicating  regions  where  data  collection,  may  be  more  critical  or 
desirable. 

c.  Indicating  possible  problem  areas  in  advance  of  physical  model 
tests. 

d.  Providing  assistance  in  the  general  interpretation  of  physical 
model  results  for  certain  phenomena. 

e.  Providing  a "quick  look"  at  a number  of  possible  configurations 
indicating  the  more  promising  plans, 

f.  Providing  a mechanism  for  investigating  certain  phenomena  such 

as  wind  which  cannot  be  easily  introduced  into  the  physical  model. 

The  integration  of  the  numerical  and  physical  wave  models  allows  for  a more 
complete  and  comprehensive  test  program. 

Under  a project  entitled,  "General  Investigation  of  Tidal  Inlets"  an 
attempt  has  been  made  to  construct  an  idealized  inlet  in  order  to  investi- 
gate the  effects  of  inlet  geometry,  friction,  and  bay  and  tidal  heights. 

An  effort  was  undertaken  to  investigate  the  ability  of  the  numerical  model 
to  reproduce  localized  velocity  patterns.  The  results  bore  out  the  fact 
that  many  details  of  the  velocity  patterns  seen  in  the  physical  model  were 
reproduced  in  the  numerical  model.  Applications  to  actual  inlets  have 
been  made  and  comparisons  with  prototype  elevation  and  velocity  data 
obtained . 
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With  minor  variations  the  tidal  model  has  been  applied  in  the  invests 
gation  of  landslide  generated  water  waves.  In  1971,  the  hydraulics 
Laboratory  at  WES  constructed  a 1:120  undistorted  scale  model  of  Libby 
Dam  and  Lake  Koocanusa,  Montana,  and  conducted  tests  to  determine  wave 
heights,  runup  and  amount  of  overtopping  resulting  from  the  sliding  of 
individual  rock  ribs  into  the  reservoir.  Subsequently  a study  was  initi- 
ated at  WES  and  directed  toward  developing  numerical  methods  for  predicting 
the  effects  oi  landslide  generated  water  waves  in  reservoirs.  The  agree- 
ment between  the  physical  and  numerical  models  was  very  good  considering 
that  each  set  of  results  (five  separate  slides  with  different  slide 
velocities)  were  subject  to  certain  inherent  limitations  imposed  by  the 
model  theory  and  others  imposed  by  lack  of  knowledge  concerning  prototype 
conditions  or  how  to  model  certain  phenomena. 

The  Lake  Erie  Regional  Transportation  Authority  is  conducting  a 
feasibility  and  site  selection  study  for  a major  hub  airport  in  the 
Cleveland  service  area.  One  of  the  possible  sites  being  evaluated  is  an 
offshore  site  near  Cleveland,  Ohio.  As  a part  of  the  feasibility  analysis 
of  an  offshore  site,  WES  is  conducting  a model  feasibility  investigation 
and  a part  of  this  study  is  to  determine  the  effects  of  the  jetport  on 
the  hydrodynamics  of  the  surrounding  area.  Two  of  the  numerical  models 
applied  so  far  in  the  project  have  been  used  to  define  the  effects  of  the 
jetport  on  the  following  phenomena: 

a.  Near-field  and  far-field  definition  of  the  wind-driven  cir- 
culation for  constant  density,  well  mixed  lake  conditions. 

b.  Storm  surge  at  the  shoreline,  from  Lorain  to  Fairport.,  Ohio. 

For  many  coastal  engineering  problems,  particularly  in  the  design  of 
harbors,  the  capability  of  estimating  the  response  of  partially  enclosed 
bodies  of  water  such  as  harbors,  bays,  etc*,  to  long  wave  excitation  can 
be  a useful  tool  for  preliminary  investigations  as  well  as  a guide  to 
physical  model  studies.  The  geometric  shape  of  the  harbor  and  the  wave 
reflections  associated  with  its  boundaries  produce  amplification  or 
attenuation  of  the  incidence  wave  regime.  This  phenomenon  is  of ter  refer- 
red to  as  harbor  resonance  or  seiching  and  can  excite  adverse  motion  of 
moored  ships  and  produce  currents  which  may  be  hazardous  to  navigation. 

New  harbors  or  modif ications  to  existing  harbors  can  be  designed  to 
minimize  the  effects  of  harbor  resonance  by  mismatching  the  frequencies 
at  which  a harbor  has  maximum  response  and  the  frequencies  of  the  pre- 
dominant long  wave  energy  which  is  characteristic  of  the  wave  regime  at 
the  harbor  site. 

The  major  problem  is  how  to  determine  accurately  the  amplitude 
distribution  function  and  frequencies  of  the  resonance  characteristics 
of  a harbor.  One  approach  is  the  application  of  numerical  techniques. 

Two  methods  have  been  applied  at  WES: 
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a.  A finite  element  method  for  obtaining  the  two-dimensional 
amplitude  distribution  and  frequencies  of  undamped,  natural 
modes  of  oscillations. 

b*  The  solution  of  a boundary  value  problem  for  long  wave- 
induced  oscillations  (forced  oscillations). 

These  methods  were  applied  to  the  harbor  at  Port  Hueneme,  California,  in 
conjunction  with  a physical  long  wave  model  study  of  the  harbor*  An 
additional  application  of  the  free  oscillation  model  was  made  to  Lake  Erie 
in  order  to  determine  the  effects  of  a jetport  structure  on  the  free 
oscillations  of  Lake  Erie. 

3.  TSUNAMI  MODEL*  A computational  scheme  relating  arbitrary  sea- 
floor displacements  and  the  consequent  surface-wave  history  was  developed 
by  Hwang,  Butler,  and  Divoky  (1972).  In  order  to  permit  more  reliable 
estimates  of  far-field  surface  displacements  as  well  as  ease  of  including 
the  effects  of  the  earth's  curvature,  the  problem  was  cast  into  a spherical 
grid  covering  the.  area  of  interest. 

The  governing  equations  can  be  written  as: 

Momentum  Equations 
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where  the  spherical  coordinate  system  is  described  in  Figure  la. 


To  solve  these  equations  an  alternating  direction  technique  is 
employed  similar  to  that  of  Leendertse  (1967).  A space-staggered  scheme, 
is  used  in  which  velocities,  water  level,  and  depth  are  described  at 
different  points  within  a grid  cell  (Fig,  lb).  The  first  half  cycle  of 
the  calculation  consists  of  computing  U and  rj  implicitly  and  V 
explicity,  advancing  from  time  nAt  to  (n  + %)  At.  The  second  half  cycle 
computes  n and  V implicitly  and  U explicitly,  advancing  from  time 
(n  + h)  At  to  (n  + 1)  At. 


By  continuing  these  processes  one  can  determine  the  values  of  U , 
V , and  n as  functions  of  time.  The  finite  difference  equations 
involved  in  the  first  half  cycle  of  the  calculation  are: 
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Fig.  la.  Spherical  coordinate  system 


Q WATER  DEPTH  h 

X WATER  LEVEL  V 

— u VELOCITY  IN  THE  DIRECTION  9 

| v VELOCITY  IN  THE  OtRECTION  <f> 

Fig.  lb.  Space-staggered  grid  definition 
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These  equations  can  be  solved  for  U and  T)  ~ where  terms  noted  with  a 
bar  are  computed  by  averaging  values  of  h and  n to  determine  the  barred 
quantities  at  the  specific  points.  The  explicit  expression  for  vn^  is 
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The  difference  equations  for  the  second  half  cycle  are  similar  with  IJ 
and  V exchanging  roles. 


Two  types  of  boundary  conditions  are  involved  in  the  computations. 
These  are  the  solid  boundaries  at  coastlines  and  the  open  sea  boundaries 
arising  from  the  need  to  truncate  the  region  of  computation  to  minimize 
computer  core  and  processing  time  requirements.  A condition  of- complete 
reflection  is  adopted  at  solid  boundaries.  No  dissipative  factor  is 
considered  in  the  model  at  present  to  account  for  loss  of  tsunami  energy 
due  to  shoreline  interaction.  This  condition  is  accomplished  by  setting 
U=0  or  V-0  at  the  appropriate  boundary. 


The  open  sea  condition  must  simulate  a total  transmission  of 
the  wave  through  the  boundary.  This  cannot  be  rigorously  achieved  without 
computation  beyond  the  boundary  and  thus  an  artifice  is  adopted  which  assumes 
that  the  wave  profile  travels  without  change  of  form  across  the  last 
interior  cell  of  the  grid  at  the  shallow-water  wave,  speed.  This  leads  to 
the  expression 
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where  B refers  to  an  exterior  face  of  a boundary  cell,  B-l,  the  cor- 
responding interior  face  of  the  same  cell,  and  AS  the  dimension  of  the  cell 
normal  to  the  boundary  face. 

The  derivative  of  the  ground  motion  term  appearing  in  the  continuity 
equation  accounts  for  the  time  dependent  vertical  bottom  displacement 
measured  from  a pre-quake  bottom  topography.  If  far~field  results  are 
the  major  interest,  an  initial  water  surface  deformation  is  assumed  instead 
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of  allowing  the  numerical  code  to  simulate  the  time  dependent  vertical 
motion.  This  initial  surface  condition  conforms  to  the  topographic  features 
of  the  ocean  floor  which  experiences  permanent  vertical  displacement 
resulting  from  a quake  disturbance.  Since  the  characteristic  time  for 
vertical  ground  movement  is  small  when  compared  to  the  long  periods  of 
tsunami  waves,  it  is  sufficient  to  assume  an  instantaneous  uplift  over 
the  entire  disturbance  area. 

The  tsunami  model  was  verified  by  bind casting  the  1964  Alaskan 
Tsunami  (Hwang,  et  al ; 1972).  With  observations  of  generated  bottom 
displacements  and  distant  wave  histories  available,  the  computed  and 
observed  waves  compared  well  in  amplitude  and  phase  of  the  dominant  lead- 
ing wave , 

Results  of  the  Aleutian  Trench  study  (Whalin,  Garcia,  and  Butler; 

1974)  indicate,  that  the  tsunami  generated  by  the  1964  Alaskan  quake  may 
not  be  a rare  event  at  all.  Further,  a tsunami  of  equal  intensity  result- 
ing from  a seismic  disturbance  located  elsewhere  in  the  Aleutian  Trench 
could  cause  significantly  higher  runup  at  particular  west  coast  locations. 
Figure  2 displays  a sample  plot  of  surface  elevation  contours  at  a time 
of  4 hours  following  a hypothetical  disturbance  in  segment  7 of  the 
Aleutian  Trench  (a  source,  location  south  of  Shumagin  Island). 


it*  TIDAL  MODEL.  A numerical  tidal  model,  based  upon  the  original 
formulation  by  Leendertse  (1967),  has  been  applied  at  WES  to  analyze  the 
tidal  hydraulics  of  harbors  and  inlets.  Again  the  fundamental  equations 
are  averaged  over  the  vertical  dimension  and  expressed  in  a Cartesian 
coordinate  system  to  yield 


Momentum  Equations 
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where  the  stress  terms  are  defined  as 


Bottom  stress 
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The  major  assumptions  involved  along  with  the  consequence  of  each 
assumption  are  outlined  below: 


a.  Internally  the  fluid  is  inviscid  (horizontal,  and  vertical 
eddy  viscosity  terms  are  considered  to  be  negligible) . 

b.  The  fluid  is  incompressible  (continuity  equation  and  gravity 
term  are  simplified), 

c.  The  vertical  component  of  the  fluid  acceleration  is  negligible 
(momentum  equation  in  the  vertical  direction  reduces  to  the 
hydrostatic  pressure  variation), 

d.  The  fluid  is  well  mixed  so  that  the  variations  of  flow  in 
the  vertical  direction  are  small  (allowing  the  averaging  of 
flow  quantities  over  the  vertical). 

To  solve  the  above  system  of  equations  numerically  the  differential 
equations  must  be  discretized  and  replaced  by  a.  system  of  finite  difference 
equations  using  central,  differences  on  a space-staggered  grid  as  in.  the 
c a s c with  the  t sunami  mod el*  Ag a in 4 Le end er t s e 1 s implicit- exp 1 i c i t mu ] t i- 
operational  method  is  employed  in  determining  the  solution  for  U , V * 
and  n as  functions  of  time. 

The  difference  equations  for  the  second  half  cycle  are  written  as: 
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These  equations  are  solved  for  Vn+'*'  and  rin+l  along  a grid  line  j 
For  simplicity  of  notation  some  terms  have  been  maintained  in  differential 
form  within  angle  brackets  < >.  Central  differences  are  used  in  evaluate 
ing  these  terms.  The  additional  velocity  component  {jn+l  can  be  determined 
explicitly  from  the  equation 
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(16) 


The  expressions  for  the  forcing  terms  F and  F are  written  as 
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In  addition  the  terms  signified  by  a single  bar  are  not  defined  at  the 
point  where  the  equation  is  written  and  are  determined  by  averaging  two 
neighboring  values.  The  terms  signified  by  a double  bar  require  four 
neighboring  values  to  obtain  an  average. 


Three  types  of  boundaries  are  involved  in  the  calculations: 

a.  A condition  of  complete  reflection  is  considered  at  solid 
boundaries . 

b . Tidal  elevations  are  input  data  at  artificial  ocean  boundaries. 

c.  Tidal  elevations  or  average  fluid  velocities  are  input  data 
at  artificial  boundaries  in  the  inlet,  harbor  or  bay. 

Sufficient  prototype  data  must  be  available  to  operate  and/or  verify 
the  model.  Along  these  lines,  it  should  be  emphasized  that  tidal  elevations 
or  average  fluid  velocities  must  be  defined  at  all  artificial  boundaries 
in  the  inlet,  harbor  or  bay. 


From  a consideration  of  the  assumptions  involved  in  the  development 
of  the  model,  it  is  apparent  that  it  should  be  most  applicable  for  consider- 
ing long  period  waves  in  non-st ratified  regions  where  vertical  accelerations 
are  small.  The  model  provides  no  indication  of  the  vertical  distribution 
of  velocity.  Phenomena  which  depend  on  the  detailed  velocity  distribution 
as  a function  of  the  depth  cannot  be  investigated* 

As  mentioned  previously,  applications  of  the  model  at  WES  include 
the  Los  Angeles-Long  Beach  Harbor  area  and  an  idealized  inlet  model. 

Figure  3 illustrates  the  tidal  velocity  field  for  a portion  of  the  LA- LB 
harbor.  The  treatment  of  breakwaters  and  piers  on  piles  are  of  particular 
interest  in  this  application.  The  frictional  coefficients  were  varied  to 
simulate  the  resistance  of  these  structures  to  the  flow.  Approximately 
1600  grid  points  were  used  in  the  computational  grid. 

5.  LANDSLIDE  MODEL,  A numerical  model  for  the  generation  and 
propagation  of  landslide  generated  water  waves  in  reservoirs  was  developed 
at  WES  (Raney  and  Butler;  1975)  as  part  of  an  effort  directed  toward 
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Illustration  of  a tidal  velocity  flow  field  for  a portion  of 
the  Los  Angeles-Long  Beach  Harbor  Complex 
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finding  analytical  and/or  numerical  methods  for  predicting  the  effects 
of  these  waves.  It  was  decided  that  a promising  initial  approach  for  the 
investigation  involved  using  a depth  averaged  formulation  of  the  fluid 
mechanics  equations. 


The  tidal  model  as  discussed  previously  can  be  adopted  with  appropriate 
modifications  to  evaluate  the  arrival  time  and  height  of  the  first  wave 
crest  which  may  result  from  a landslide.  Results  from  the  physical  model 
study  conducted  at  WES  (Davidson  and  Whalin;  1974)  indicated  that  in  most 
cases  the  first  wave  crest  was  the  largest  measured  at  each  of  the  model 
gauges.  The  pertinent  modifications  to  the  tidal  model  are: 

a.  A vertical  bottom  deformation  term  representing  the  passage 
and  settling  of  landslide  material  is  introduced  into  the 
continuity  equation  in  a manner  similar  to  the  ground  motion 
term  appearing  in  the  tsunami  model. 

b.  The.  acceleration  effect  of  the  landslide  on  the  fluid  with 
which  it  is  in  contact  is  introduced  into  the  momentum 
equations  as  part  of  the  forcing  terms. 

Thus  the  equations  can  be  written  as: 


Momentum  Equations 


(19) 

(20) 


9JJ  + 9U  9n  _ 

9t  + dx  + dy  + x + x 


8V  4.  ,i9V  W3V  9n 
9t  + ^ + V3y  + 


R + L 

y y 


(21) 


Continuity  Equation 

I?  + h { (h+n)u  }+  h { (h+n)v  }=  It 


The  method  of  solution  is  identical  to  that  used  in  the  tidal  model. 


Boundary  conditions  are  similar  to  those  adopted  in  the  tsunami  model. 
The  fictitious  open  boundaries  arise  from  the  need  to  truncate  the  region 
of  computation  (in  order  to  minimize  computational  time  requirements). 

Also  considered  is  a time  dependent  boundary  between  the  landslide  surface 
and  the  water  in  the.  reservoir. 


The  landslide  is  represented  by  a time  dependent  deformation  of  the 
bottom  of  the  reservoir  plus  additional  terms  to  represent  the  effect  of 
the  landslide  on  the  reservoir  due  to  viscous  and  inertia  forces.  The 
bottom  deformation  propagates  into  specified  regions  of  the  reservoir  at 
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the  average  speed  of  the  landslide  with  the  deformation  at  any  particular 
location  increasing  from  zero  to  a maximum  value  according  to  a specified 
time-displacement  relationship.  For  those  .portions  of  the  bottom  of  the 
reservoir  through  which  the  landslide  passes  but  which  do  not  experience 
a net  change  in  ground  elevation  as  a result  of  the  final  slide  deposi- 
tion, the  deformation  is  allowed  to  return  to  zero  at  a specified  rate. 

The  handling  of  the  landslide  condition  is  illustrated  in  Fig.  4*  The 
direction,  extent,  and  magnitude  of  the  bottom  deformation  is  determined 
by  knowledge  or  assumptions  concerning  the  path  and  final  disposition  of 
the  particular  landslide. 

The  water  in  the  reservoir  experiences  an  acceleration  due  to  the 
force  exerted  by  the  landslide  at  the  time  dependent  boundary  between  the 
water  and  the  landslide*  This  force  per  unit  mass  is  considered  to  consist 
of  a component  due  to  the  displacement  of  the  water  by  the  slide  plus  two 
components  which  act  on  fluid  elements  in  contact  with  the  landslide: 

L = a(v  - u)2  + B (v  - u)2 

X X X 

L = a(V  - V)2  + 3 (V  - v)2 

y y y 


- the  x component  of  the  landslide  velocity 

- the  y component  of  the  landslide  velocity 

= viscous  drag  parameter 
= pressure  drag  parameter 

‘ first  component  of  L and  L is  related  to  the  viscous  drag 

>y  the  slide  upon  the  fluid  with  which  it  is  in  contact.  The 

second  component  of  L and  L expresses  a pressure  drag  exerted  on  the 
water  by  the  front  ofXthe  sl¥de*  The  viscous  drag  is  considered  to  act 
at  all  points  of  contact  between  the  slide  and  the  water*  The  pressure 
drag  acts  only  at  the  leading  face  of  the  landslide. 

The  landslide  is  thus  represented  numerically  by  a combination  of 
terms  whose  net  effect  in  the  numerical  model  should  be  a reasonable 
representation  of  the  large  scale  physical  effects  produced  by  a landslide 
entering  a reservoir*  A representation  of  the  landslide  propagates  into 
the  numerical  representation  of  the  reservoir,  accelerating  the  fluid  due 
to  physical  displacement,  viscous  effects,  and  pressure  drag  effects.  The 
resulting  waves  propagate  across  the  reservoir  in  accordance  with  the 
governing  equations.  The  wave  height  and  velocity  components  are  calculated 
for  each  grid  cell  at  the  end  of  each  time  step. 
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With  a jetport  structure,  5 miles  offsh 

torizontal  velocity  field  in  the  Clevela 
at  the  surface 


a.  A finite  element  method  (FEM)  used  to  find  the  natural, 
oscillation  solution  for  an  open  or  closed  basin,  and 

b „ a method  to  estimate  the  two- dimensional  amplitude 

distribution  function  of  wave-induced  (forced)  oscil^ 
lations  in  harbors  of  complex  geometry  as  well  as  the 
wave  frequencies  at  which  maximum  harbor  response  occurs. 

In  theory,  both  types  of  oscillations  can  be  mathematically 
represented  by  boundary  value  problems  formulated  from  the  long  wave 
equations  and  appropriate  boundary  conditions. 

The  Fin i te  E 1 ement  Method 

Assuming  a periodic  form  of  the  solution  of  the  linear  long  wave 
equation  for  a variable  depth  basin,  the  governing  differential  equation 
for  undamped  harmonic  oscillations  of  a variable  depth  basin  becomes 


3 x 


8y 


<32>  {h(x,y)  MJSULl  ) + ^ (b(x,y)  ) + A2n(x,y)  - 0 

where 
(33) 


'2  2, 

A = 03  /g 


The  boundary  condition  for  this  problem  is  that  the  normal  component  of 
velocity  at  the  basin  boundary  be  equal  to  zero.  Since  equation  (32) 
(special  form  of  the  Helmholtz  equation)  is  the  equation  for  a standing 
wave,  this  boundary  condition  can  be  expressed  as 


(34) 

where 


= o 

8n 


n - unit  normal  to  boundary 

For  the  open  basin  case  the  boundary  along  the  basin  opening  cannot  be 
satisfied  by  the  above  boundary  condition  since  this  is  not  the  physical 
case.  The  surface  elevation  along  such  an  opening  is  assumed  to  be  a 
nodal  line.  Thus,  the  boundary  condition  along  the  open  boundary  is 

(35)  n(x,y)  = 0 

Therefore,  equations  (32-35)  are  the  partial  differential  equation  and 
the  boundary  conditions  for  the  boundary  value  problem  of  the  natural 
oscillations  of  a body  of  water.  It  can  be  shown  that  the  x and  y 
components  of  velocity  (u,v)  are  proportional  to  the  x and  y com- 
ponents of  the  horizontal  gradient  of  surface  elevation. 
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Solutions  to  this  boundary  value  problem  for  basins  of  complex 
geometry  and  variable  topography  have  been  estimated  numerically  by  Abel 
(1970)*  He  used  a calculus  of  variations  approach  and  obtained  an  Euler- 
Lagrange  formulation  of  the  boundary  value  problem.  The  variational 
formulation  for  the  boundary  value  problem  basically  consists  of  the 
functional  integral 

(36)  X = ///i{ h(x,y)  [^(x,y)]2  + h(x>y)  - A2n2(x,y)}dxdy 

which, for  a stationary  value  of  x (maximum  or  minimum)  with  respect  to 
n(x,y),  the  Euler-Lagrange  condition  of  the  functional  is  identically 
equal  to  equation  (32).  In  addition  to  this  functional  integral,  the 
Euler-Lagrange  formulation  possesses  appropriate  boundary  conditions. 

The  solution  n(x,y),  which  maximizes  or  minimizes  equation  (36),  is 
obtained  by  utilizing  finite-element  techniques.  Details  of  these 
techniques  can  be  found  in  most  finite-element  texts  (i.e.,  Desai  and 
Abel,  1972)  and  will  not  be  pursued  here.  Concisely,  the  finite-element 
technique,  as  utilized  in  this  problem,  is  a numerical  approximation  pro- 
cedure for  evaluating  the  integral  equation  obtained  from  maximizing  the 
functional  integral.  The  surface  area  of  the  study  region,  A , i.s 
divided  into  N subregions.  A*  (elements)  with  e = 1,  . . . , n,  with  the 
shape  of  the  element  (triangle  or  quadrangle)  being  defined  by  a number 
of  nodes  (vertices)  which  connect  the  element  to  other  elements  of  the 
grid  * The  integral  equation  is  expressed  and  applied  at  the  element 
level  and  later  assembled  for  the  total  domain  of  interest.  This  procedure 
produces  a system  of  M (total  number  of  node  points)  linear  algebraic 
equations  in  terms  of  the  M values  of  nm  at  the  nodal  points  and  in 
which  the  values  of  nm  form  the  solution  of  equation  (32).  This  system 
of  equations  can  be  expressed  in  the  form  of  an  eigenvalue  problem  and 
solved  by  matrix  procedures  for  the  eigenvalues  (natural  frequencies  of 
the  system)  and  the  eigenvectors  (configuration  of  relative  magnitudes 
of  surface  displacements  at  each  nodal  point). 

Forced  Oscillation  Method 


The  boundary  value  problem  for  long  wave-induced  oscillations  in 
arbitrary  shaped  harbors  of  constant  depth,  h , is  governed  by  the 
following  differential  equation 


(3H  ^ + a2!^  + k2n(Xjy)  , 0 

3x  9y 

which  is  called  the  Helmholtz  equation , where  the  wave,  number  k is  defined 
as 

(38)  k = co//gh 

* WES  internal  Memorandum  for  Record. 
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Lee  (1969)  in  mathematically  solving  this  boundary  value  problem 
divided  the  area  of  interest  into  two  regions,  a harbor  region  and  an 
infinite  ocean  region,  having  the  same  constant  depth,  h . For  each 
region,  the  Helmholtz  equation  is  the  governing  differential  equation- 
therefore,  the  problem  in  each  region  is  to  determine  the  wave  function 
r](x,y)  which  satisfies  this  differential  equation  and  the  appropriate 
boundary  conditions.  In  the  ocean  region,  the  solution  of  the.  wave 
function  is  assumed  to  be  composed  of  three  separate  parts: 

a.  An  incident  wave,  r).. (x,y) 

b.  A reflected  wave,  n <x,y),  from  the  coastline  with  harbor 
entrance  closed  and 

c.  A radiated  wave,  0^(x,y),  emanating  from  the  harbor  entrance. 

Physically,  the  wave  regime  in  the  ocean  region  is  composed  of  a steady- 
state  standing  wave  regime  with  a radiated  wave  due  to  the  presence  of 
the  harbor  superimposed  on  it. 

The  boundary  conditions  for  both  regions  are: 

a.  Along  all  solid  boundaries  such  as  the  harbor  boundary  and 
the  coastline,  the  normal  velocity  component  is  zero  or 

O9)  8n(x,y)  _ 

9n  “ U 

b.  Along  the  harbor  entrance,  a "continuity"  or  matching 
condition  is  used  such  that  the  wave  amplitude  and  slope  of 
the  water  surface  obtained  from  the  solution  in  the  ocean 
region  must,  equal  these  quantities  obtained  from  the  solution 
in  the  harbor  region. 

c.  At  the  harbor  entrance  an  outwardly  radiated  wave  is  generated 
by  the  harbor  and  diminishes  in  magnitude  as  the  seaward 
distance  from  the  harbor's  entrance  increases  such  that  at 

an  infinite  distance  from  the  harbor  there  is  no  effect  of 
the  harbor  on  the  wave  regime  in  the  ocean. 

The  latter  condition  allows  some  energy  In  the  harbor  to  be  dissipated 
by  radiating  energy  into  the  ocean,  thus  limiting  the  amplification  of  the 
harbor  response  at  resonance.  This  condition  also  governs  the  selection 
of  the  mathematical  form  of  the  fundamental  solution  of  the  Helmholtz 
equation  in  the  ocean  region  and  likewise  in  the  harbor  region.  The  form 
of  the  fundamental  solution  chosen  is  the  Hankel  function.  H1  (kr) 
where  o 

<4°>  r = JlFTj- 
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This  form  is  chosen  because  it  satisfies  the  Helmholtz  equation,  decays 
to  zero  at  infinity  and  possesses  the  proper  type  of  singularity  at  the 
origin. 


Using  Green’s  identity  formula  and  the  fundamental  solution 
TH  U>(kr)"| 

L o -J  of  the  two-dimensional  Helmholtz  equation,  a solution  for  the 

wave  function,  n(x,y),  at  any  position  in  the  domain  of  interest  is  obtained 
in  integral  form  as  a function  of  the  value  of  ri(x,y)  and  Bp  (x,y) 

at  the  boundary, 


(41)  n(x)  = - £ / { n(XG>  [H^Ckr)]-  H^^kr)^  [n(xG)] } ds(xo) 


This  form  of  solution  for  the  Helmholtz  equation  is  known  as  the  Weber 
solution,  and  its  derivation  is  presented  in  potential  theory  texts  such 
as  Baker  and  Copson  (1950),  To  determine  the  wave  function  at  any  interior 
point  in  the  region  of  interest,  the  value  of  ri(x,y)  and  the  value  of 


on  t he  boundary  of  the  region  are  required,  Lee  and  Raichlen 
(1972)  showed  that  a matrix  approximation  of  an  integral  equation  similar 


in  form  to  equation  (41)  could  be  used  to  obtain  ri(x,y)  along  the  boundary 
in  terms  of  along  the  boundary.  With  the  boundary  condition  that  the 

dn 

normal  derivative  of  the  wave  function  on  the  solid  boundary  is  zero,  the 
wave  function  on  the  boundary  and,  in  turn,  the  wave  function  at  any 
interior  point  can  be  obtained  as  a function  of  the  unknown  normal  derivative 
of  the  wave  function  along  the  entrance  of  the.  harbor.  In  a similar  manner 
as  the  harbor  region,  the  wave  function  at  any  interior  point  of  the  ocean 
region  is  obtained  as  a function  of  the  unknown  normal  derivative  of  the 
wave  function  along  the  harbor  entrance.  Therefore,  by  matching  the 
solution  in  each  region  at  the  harbor  entrance,  the  unknown  normal  derivatives 
of  the  wave  function  at  the  harbor  entrance  can  be  obtained.  Thus,  the 
wave  function  on  the  boundary  can  be  calculated  and  the  wave  function  at 
any  position  in  the  harbor  can  be  determined.  Then,  the  amplification 
factor  for  the  response  of  the  harbor  can  be  calculated  by  taking  the  ratio 
of  the  wave  amplitude  at.  any  position  in  the  harbor  to  the  incident  plus 
reflected  wave  amplitude  at  the  coastline  with  the  entrance  closed. 


Applications  of  Harbor  Oscillation  Models 

During  a physical  long  wave  model  study  at  WES  of  Port  Hueneme  in 
California,  a review  and  evaluation  of  existing  theories  of  wave-induced 
oscillations  of  harbors  were  conducted  with  some  limited  application  to 
Port  Hueneme  of  Lee  and  Raichlen’ s numerical  method  as  well  as  Abel’s 
numerical  procedure  for  natural  modes  of  oscillation.  This  study  allowed 
WES  personnel  an  opportunity  to  become  familiar  with  the  limitations  and 
applicability  of  these  numerical  models  and  to  establish  their  use  as  a 


496 


X^  Xs^  ^ ^^XXc~  X,  XacSsssv^  v 'X^ 

X^xXsv^^s^  x X$^  Xx^^^asts*  ^XXv  Xs^t.  ns^X^lXv.  ^^eC^^Oss^s 


Xv.  X^XaX^^.  "^x®_  X^xX^a_X  Xx^^tXXi  ^ V^v%.  \X\  v *X$s$s^N  X&&. 


<X&.  ^ 

^ V XRi^X 

v SX^ 
fcS0\s^ 

3R£V^ 

^ %ssX^  X^ 

XX,  X^^^^SS?V.^L 
v Xv. 

^ ^dXX^Ji-  X\5£\X0\d^dv 
X"  XXdXXX  >s\  Xs^  x'%sas&5j*: 


NV-^  Xse_x  ^^XX^XX^X  x 

vX\x^  ^>A  Hsss.  Xsssl  XsXx^\.  ^ x%X^\xi^ 

"*XXX  a»  \\sXX^-^v^^X  ^txX^X^  Xx®^  x^\\%  Xv  X®*Xx 

^eXX  x^XXXx,  Xv  Xx^  ^^X\aXx  ^^XXv  Vxv\^  X^X  . Xx^X^xass^feX  ^ 
XL  X^X^X^x  Xs^.  X53&5S*.  xx 

"XXX  Xxs_  ®X  X^  \mxx  xv^Xst^  LX  X^  ^^xxxs^  Xs&ss 

"VX^.  ^xXx^xj^.  Xxr,  x^xaXXa^  X^\  X^  Xv  ^ 

^k  X^l  ^^vXx^X  X\  \X^  XX\XXO\sd^  ^X  m\x\X\^ 

^k  ^ XX.^,  ^vX\  n X^l  ^wX^X  vass\  ^ 

X^^X'x^XXx  >X  %XXX\X^  x^jsssx.  s 

X&®„  ^^aXsXx  xxXLX  Xx  X>x  ax  ^X^^x 

Xx  X^&.  X^xx$^£d®>\  N 

X^  X^xx^x®?\  x%^Xv^v  X^  x^xs&ssx  V« 
^SX^s^  ^xxx.  XXXvX\si5X  xx'^XL'*®^  <cX  Xssx. 

Nx^X*o^®^Xx  Xv  XXss*  ^^LXxLX  "^XXv  Xx  axvs^ 
xk^.  Xv_  xX^LNX\ss^ 


a.^s^as^x  Xxx^s^Xxs*  ^ XXX^ 
V XX  XsXXs^  X^^  ^Xv*XX\^ 
vxa^Xx^*  a^dX  a^a^s^xXx^a*  Xv 
^.X^X  XX^X\  X^.XxXx  X^s^v 
XvV\\ 


\X  X^e^  ^m\vm.  > X^x  ^ 

^X\\^  ^ a^eX  ^X  x^x^  S^ 

^X^xX^X^x  ^X  XVXm  . 

X^^fe.  ^\mXX\wXxXX 

X^sss  ^ X^  X^_  X^\X\^ 


X^X^v  X X^XiS^  XXv^ 

^ ^xxx  xx\  *^xxxx 

^>X5N  ^ Xs^^  XsxXX^ 

^ ^GfcXx^rXv.  ^rsXx 

€C\  X^V  XXX^^  XX^  X^sl.  X^- 

Xv  ^XX&xX^l  ^^XXvXd^^  \5d 

^.:XvKm  ■^v^^j^dX  Xs\^  XX^  ^f^L1^.  XvXX\5^  Xx  - 


X^v.  ^XdXSXX^dsX. 

x?e^^x  ^odX\  ^ x^^xxsx 

^XXv 

V Xv  X^ft. ! 

X^  TX\Xw^ 
X^SS^XXX  ^X&\  - 


XXXX,  ^sSXSXXX^  . XX?o^^_  ^^sXNs^ 

CX  X^sX^X^.  X^^.  ^ ^ x^ 

Xv  ^ XvcCCXs^  ^^^xssss^^  X^  ^tX5^X  X?dSd, 
dt^Xv*.  Xs&^u  X\^\\_%  XSi  XX\S5rV. 


^.h  \XsXv,  Xv^^XX^ddv^e^  X^.  X^t  X ^>^-^XX\^_  X^^Xdvv  ^5^  <XX^  Xc^_ 

XXX^r^vv  ^^ddX  Xv  X^c^as^X^ 

Xdv  , ^XXX^Xxr^  ^ 

V aaX  ^v  ^XXvN^.  W XX^ 

^ X^XX^  x^.  X^  XX®^  ^X^X\ . 


X\  v X^XXX.  v^Xv  X^^dd^/^XX  X&:^X'i3^  X^av  a&j&sssj^  Nx^XXv.  X^ 

X^®d.  Xv^X^  Xv  XXdXv  W^sj^  ^ ^sd^X'dv  N ^Vd,  Xdv  X<XXX^d^ 

X^  ^:d^X\XeX^^  Xtt^XaX\5^s  X^iXdXXX  ^ ^ \^\^m_^\Xd^ 

%xXXXs^  >^X  ‘^XCX  Xv^Kd 


XX  * X^X  Xa&&2i£SS®u 


SEICHE  RELATIVE  AMPLITUDE  COMPARISON 


LU 

h- 

CM 

O) 

CO 

CO 

3 

o> 

cr> 

ZD 

CO 

CO 

CO 

CD 

o 

o 

1 

CD 

1 

CD 

i 

1 

1 

1 

CD 

1 

i 

CM 


t iO 
ir>  co 


VO 

CD 

O 


(rt  o O <J>  co  uo  CM 

(0  (0  tp  l/1  *H  CM  CM  If) 


m 

<r> 


CD 

i 


O 

o 


C£ 

O 

GQ 


QC 

LU 


O o ^ o 

-J  o os  Q 


QC 

O ZD 
. O-  C 

yj  be  95  !±J 


U.  0£  z OC 

30D0flS0<DC_J 
CDQ_OQL.LUQ_Li_UJO 


o O LU 
Z Q O 

o > — LU  QS 
OC  QC  J Z 

o o o o 

X CL  h Z 


503 


FROM  PLATZMAN  AND  RAO  (1963) 


included  in  the  model*  Various  applications  to  actual  inlets 
will  be  made  within  calendar  year  1975* 

c*  In  addition  to  the  models  now  being  applied  in  the  Lake.  Erie 
Jetport  study,  a time-dependent,  rigid  lid  model  including 
non-linear  convection,  horizontal  diffusion,  variable  density, 
and  coupling  with  the  energy  equation,  will  be  operationalized 
at  WES.  Developed  by  Paul  and  Lick  (1973),  the  model  has 
been  used  to  study  the  flow  from  the  Cuyahoga  River  into  Lake 
Erie  under  conditions  such  that  the  river  water  is  either  at 
the  same  temperature,  warmer,  or  cooler  than  lake  water.  The 
model  is  presently  being  used  by  Paul  and  Lick  to  study  the 
thermal  efflux  from  power  plants  and  is  also  being  applied 
by  Gedney,  Molls  and  Paul  to  a study  of  the  overall  cir- 
culation in  Lake  Erie  under  stratified  conditions. 

d.  A wave  information  program  is  currently  underway  at  WES  and 
its  primary  objective  is  to  produce  a wave  climate  on  the 
Great  Lakes  for  application  to  the  design  of  dredged  material 
retaining  structures.  Specification  of  the  wave  climate  will 
include:  1)  probabilities  of  wave  heights,  wave  periods,  and 

approach  directions  at  the  edge  of  the  breaker  zone;  2)  estimates 
of  recurrence  intervals  for  large  waves;  and  3)  estimates  of 
probable  errors  associated  with  hindcasted  wave  heights. 

Several  numerical  hindcast  models  will  be  tested  for  accuracy 
against  recorded  wave  spectra  around  the  Great  Lakes. 

These  models  compute  the  entire  five-dimensional  energy  spectrum 
of  waves  ( f (x , y , a),  g,  t ) ),  With  the  numerical  models,  the 
effects  of  bottom  topography  on  wave  propagation  and  energy 
loss  due  to  the  interaction  of  waves  with  the  bottom  can  be 
computed.  Also  included  in  the  models  are  the  effects  of 
fetch  length  and  width  limitations  and  the  effect  of  finite 
wind  durations.  The  development  of  the  models  has  been 
carried  out  primarily  by  groups  of  researchers  at  Scripps 
Institute  of  Oceanography,  New  York  University  and  Westing- 
house  Ocean  Research  Laboratory.  For  some  details  of  the 
concepts  of  these  models,  the  reader  is  referred  to  Barnett 
(1966)  and  Inoue  (1966). 

10.  SUMMARY.  To  recap,  a broad  spectrum  of  numerical  models,  based 
on  long  wave  theory  approximations,  has  been  presented.  These  models  are 
all  presently  being  used  and  maintained  at  WES.  A continuing  effort  to 
improve  the  models  is  also  a major  objective.  WES  has  always  played  the 
major  role  in  the  United  States  in  regards  to  physical  models  of  hydraulic 
systems,  whether  they  be  harbors,  inlets,  rivers,  spillways,  etc.  With 
numerical  modeling  now  being  recognized  as  an  important  tool  in  the  investi- 
gation of  hydraulic  processes,  it  becomes  imperative  for  WES  to  remain 
abreast  of  the  state-of-the-art  practices  in  such  efforts.  Throughout  the 
descriptions  of  applications  of  the  various  models , the  authors  have  tried 
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to  stress  the  importance  of  using  numerical  models  as  a complementary 
activity  when  conducting  hydraulic  investigations. 

An  important  consideration  to  keep  in  mind  is  that  both  numerical 
and  physical  models  each  have  basic  limitations  and  strengths*  In 
particular,  the  assumptions  and  constraints  inherent  in  each  model  must 
be.  cautiously  considered  in  the  application  of  numerical  models  of  long- 
waves  and  wind-driven  circulations*  Both  the  selection  of  the  prototype 
conditions  being  modeled  as  well  as  the  interpretation  of  the  results 
must  be  reasonably  consistent  with  the  model  assumptions. 
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NOTATIONS 


A Surface  area  of  study  region 

q Frictional  coefficient 

C Drag  coefficient  for  calculation  of  wind  stress 
d 

E Eddy  diffusivity  coefficient 
v 

f Coriolis  parameter 

g Acceleration  of  gravity 

H ^ Zero  order  Hankel  function  of  the  first  kind 

o 

Subscripts  used  to  denote  spatial  coordinates 
K Wind  stress  coefficient  in  boundary  conditions 

r Wave  number 

L L Landslide  force  components 

x’  y 

n Superscript  used  to  denote  a time  instant 

n Refers  to  a normal  line 

r ,r  Components  of  forcing  terms  in  Landslide  Model  — 
x y forces  due  to  slide 

r Distance  function 

r Radius  of  earth 

e 

S Distance 

t Time 

U9V  Integrated  velocity  components 

u*v,w  Three-dimensional  particle  velocity  components 

V V Components  of  landslide  velocity 
x*  y 

W Wind  speed 

a 

v 7 Cartesian  coordinated 
x > y 5 ^ 


excluding 
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NOTATIONS  CONTINUED 


Viscous  drag  parameter 

Pressure  drag  parameter 

Coefficients  in  stream  function  equation 

Interval  operator 

Ground  motion  parameter 

Surface  elevation 

Latitudinal  coordinate 

Eigenvalue 

Density  of  air 

Bottom  stress  vector 

Wind  stress  vector 

Longitudinal  coordinate 

Varitional  integral 

Stream  function 

Frequency 


procedures  for  data  collection  and  analysis.  For  example,  the  Los  Angeles 
and  Long  Beach  Harbors  mode]  (Whalin  et  ai,  1974)  covers  an  area  of  45,930 
square  feet.  Considerable  effort  over  the  past  several  years  has  been 
devoted  to: 

a.  Design  and  development  of  automated  systems  for  acquisition 
of  wave  data  and  for  model  controls. 

b.  The  automation  of  data  reduction  and  analyses* 

There  are  presently  three  automated  systems  for  hydraulic  models 
in  the  Hydraulics  Laboratory.  One  system,  (Daggett,  1971)  developed  a few 
years  ago,  is  operational  on  the  New  York  Harbor  model  in  the  Estuaries 
Division  of  the  Hydraulics  Laboratory.  The  other  two  systems,  which  are. 
the  subject  of  this  paper,  are  operational  on  various  wave  models  in  WDD . 

The  need  for  automated  model  procedures  and  the  success  of  the  above 
automated  systems  have  led  to  future  plans  for  providing  model  automation 
to  most  modeling  facilities  of  the  Hydraulics  Laboratory.  Such  plans  are 
basically  to  extend  the  automated  capabilities  of  the  above  systems  to 
other  facilities  and/or  installation  of  additional  automated  systems* 

2 » SYSTEM  C ON  F I Gil  RAT  LON.  The  automated  systems  for  wave  models 

have  been  given  the  title  "Automated  Data  Acquisition  and  Control  System" 
(whose  acronym  is  ADACS)  and  have  two  primary  functions:  (1)  automated 

acquisition  of  wave  data  in  a format  (magnetic  tape  or  disc)  compatible 
for  computer  reduction  and  analyses  and  (2)  automated  control  of  wave 
sensor  calibrations  and  of  the  wave  generators.  The  system  configuration 
(Fig.  2)  of  ADACS  consists  of  the  following  subsystems: 

a.  Digital  Data  recording  and  controls. 

b.  Analog  recorders  and  channel  selection  circuits. 

c.  Wave  sensors  and  interfacing  equipment. 

d.  Wave  generators  and  control  equipment. 

The  digital  data  recording  and  control  subsystem  is  basically  a 
minicomputer  (one  psec  memory  cycle  time)  with  the  following  characteristics 
and  peripheral  devices: 

a*  32K,  16  bit  words  of  core  memory. 

b.  Analog  to  digital  pack  featuring  64  analog  inputs  (+10  volts, 
FS),  45  KHz  multiplexer  and  12  bit  (including  sign)  A/D 
converter. 

c.  Interval  timer  with  a one  psec  counter. 
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Fig.  2.  Schematic  of  components  of  automated  data 
acquisition  and  control  system 
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AUTOMATED  DATA  ACQUISITION  AND  CONTROL  SYSTEM  (ADACS) 
FOR  HYDRAULIC  WAVE  AND  TIDAL  INLET  MCOELS 
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be  moved  vertically  a precise  distance*  The  electric  motor  for  each 
wave,  sensor  is  controlled  by  a control /.sense  line  and  a relay  contact. 

The  minicomputer  controls  the  vertical  movement  of  each  wave  sensor  by 
actuating  the  control/sense  line.  The  central  processing  unit  (cpu) 
acts  as  a voltage  comparator  by  monitoring  the  potentiometer  voltage  and 
comparing  it  to  a reference  voltage  which  is  determined  from  desired 
displacement  and  potentiometer  calibration.  When  the  voltage  comparison 
is  satisfied,  the  control/sense  line  is  reactivated;  the  electric  motor 
stops;  and  voltage  samples  from  the  rods  and  potentiometers  are  acquired. 

By  systematically  moving  each  wave  sensor  through  11  quasi -equally 
spaced  locations  over  the  range  of  the  rod  length  used,  voltage  versus 
known  displacements  are  obtained  from  which  a calibration  curve  for  each 
sensor  can  be  calculated  and  recorded  on  magnetic  tape  or  disc.  After 
collecting  the  calibration  data,  the  minicomputer  analyzes  these  data  by 
least  squares  fitting  a set  of  curves  (linear,  quadratic,  or  spline)  to 
the  data,  determining  the.  best  order  of  fit,  and  comparing  the  maximum 
deviation  of  the  best  fit  to  a previously  acceptable  value  for  this 
maximum  deviation.  If  the  fitted  curves  are  not  acceptable,  the  mini- 
computer flags  that  channel  in  the  calibration  record  on  the  magnetic 
tape  or  disc  for  further  analyses.  Any  malfunctioning  sensors  are  listed 
on  the  teletype  for  the  operator  to  determine  the  required  action  (i.e., 
accept  present  cal  ibration,  clean  bad  rods,  recalibrate, . etc: .) . Having 
completed  the  calibration  inode,  the  original  calibration  data  for  each 
set  of  parallel  rods  and  potentiometer  as  well  as  the  calibration 
coefficients  are  written  into  a file  on  magnetic  tape  or  disc.  Analytical 
considerations  of  calibration  curve  fitting,  accuracy  of  reference 
potentiometers  and  repeatability  test  of  calibration  procedures  have 
demonstrated  that  an  accuracy  of  +0.001  feet  in  wave  height  is  obtainable. 

5 . DATA  ACQUISITION.  During  the  acquisition  mode  wave  data  for  a 
specified  wave  "condition  at  the  wave  generator  are  collected  from  a 
maximum  of  50  wave  sensors,  recorded  on  analog  strip  charts,  digitized  and 
recorded  on  magnetic  tape  or  disc  for  further  analyses.  The  sampling 
scheme  is  quite  flexible  and  can  be  tailored  for  different  applications  with 
maximum  thru~put,  rates  theoretically  limited  by  the  multiplexer  rate 
(45KHz)  and  allocatable  buffer  size.  However,  for  specific  types  of 
tests  the  execution  time  of  applications  software  and  the  specified  number 
of  discrete  samples  per  wave  period  degrade  the  about  maximum  thru-put 
rate.  The  present  sampling  scheme  used  for  each  wave  sensor  is  60  discrete 
voltage  samples  equally  spaced  over  each  wave  period  for  a predetermined 
number  of  periods  (normally  24).  The  minicomputer  calculates  from  input 
parameters  the  required  timing  interval  between  multiplexer  scans  to 
provide  the  correct  sampling  rate  and  Initializes  counters  for  determining 
completion  of  wave,  tests.  In  addition,  it  controls  the  starting  of  the 
wave  generators,  lags  the  beginning  of  data  acquisition  by  a specified 
number  of  wave  periods  after  starting  the  generators,  and  provides  timing 
pulses  for  synchronizing  and  controlling  the  analog  recorders.  At  com- 
pletion of  the  acquisition  mode,  the  calibration  data  and  wave  data  have 
been  recorded  in  both  analog  and  digital  form.  In  binary  form  on  magnetic 
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tape  or  disc,  these  data  with  a header  for  test  identification  and 
pertinent  parameters  are  available  from  disc  for  direct  analyses  by  the 
minicomputer  or  on  magnetic  tape  in  a format  suitable  for  backup  analyses 
on  WES  Honeywell  G6  3 3 computer. 

DATA  ANALYSIS.  The  analysis  of  the  data  may  be  performed  by 
either  the  minicomputer  or  by  a Honeywell  C635  of  the  Automated  Data 
Processing  Center  at  WES.  Schematically,  the  procedure  for  data  analyses 
is  as  follows: 


a . Progr am  Initial  1 z a t i o n 

(1)  Input  test  parameters  and  option  flags. 

(2)  Read  and  decode  data  tape  or  disc  file. 

(3)  Demultiplex  data  files. 

b.  Wave  Record  Analyses 
(1)  H + tf  and  T _+  Grf. 

<2>  >w 

(3)  H where  x is  a specified  percent  of  the  highest 
wave  heights,  normally  x = .333;  thus  H^  y ^ is  signi- 
ficant wave  height. 

(4)  Option  to  plot  wave  heights  versus  time. 

c.  Fourier  Analyses 


(1) 

Autospectrum 

(a) 

Amplitude- frequency 

(b) 

Energy 

(c) 

Spectral  smoothing 

(d) 

Plots  of  above  parameters 

(2) 

Cross  spectrum 

(a) 

Energy 

(b) 

Coherency 

(c) 

Phase 

<d) 

Plots  of  above  parameters 
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The  results  of  the  data  analyses  can  be  plotted  by  pen  plotter  or  CRT 
plotter  (microfilm  and  hard  copy)  on  Honeywell  G635  system  and  by  electro- 
static printer  on  minicomputer.  For  each  model  study,  the  original  data 
and  analyzed  results  of  all  tests  are  permanently  stored  on  magnetic  tape 
for  future  reference.  With  these  analyses  of  wave  records  from  various 
harbor  locations,  the  amplification  factors  or  harbor  response  for  various 
input  wave  conditions  are  estimated.  These  results  provide  the  hydraulic 
engineers  with  the  basic  data  whereby  the  effects  of  various  proposed 
expansions  and  modifications  to  an  existing  harbor  can  be  evaluated. 

7 . SUMMARY . Automated  control  of  the  calibration  of  wave  sensors 
has  reduced  the  time  required  to  calibrate  sensors  by  a factor  of  four. 

In  addition,  at  least  twice  the  number  of  model  tests  can  be  run  during 
a day.  The  cost  for  data  analyses  has  decreased  by  at  least  a factor  of 
two  with  test  results  returned  to  the  project  engineer  immediately  after 
model  testing  when  analyzed  by  minicomputer  subsystem  and  over  night  when 
analyzed  by  backup  mode  with  Honeywell  G635  system.  Thus,  the  automated 
procedures  described  in  this  paper  have  enhanced  the  modeling  capabilities 
and  increased  the  efficienty  of  modeling  procedures.  Future  efforts 
include  an  evaluation  program  for  wave  and  tidal  height  sensors,  velocity 
meters,  spectral  wave  generation,  and  an  expansion  of  automated  capabilities 
to  other  model  facilities.* 
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